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Preface 


Differential geometry studies properties of curves, surfaces, and smooth manifolds 
by methods of mathematical analysis. Riemannian geometry is a section of dif- 
ferential geometry which studies smooth manifolds with an additional structure, 
Riemannian metric. The main part of this book is devoted to exactly Riemannian 
geometry. The exception is affine differential geometry, projective differential 
geometry, and connections more general than the Levi-Civitaé connection, which 
originates from a Riemannian metric. 

The book begins with the simplest object of differential geometry, curves in the 
plane. The most important characteristic of a curve at a given point is curvature. 
The first chapter considers both local properties of curvature (the Frenet—Serret 
formula and osculating circles) and global ones (the total curvature of a closed 
curve and the four-vertex theorem). The total oriented curvature of a closed curve 
is invariant with respect to a regular homotopy, and vice versa: if the total oriented 
curvatures of two curves are equal, then these curves are regularly homotopic (this 
is the Whitney—Graustein theorem). To each curve corresponds its evolute, that is, 
the locus of centers of all osculating circles of this curve, that is, the envelope of the 
family of normals. With respect to its evolute, the given curve is an involute. But the 
inverse operation of assigning an involute to a curve is ambiguous: every curve has 
a whole family of involutes (a curve orthogonal to a family of normals can be drawn 
through every point of a normal). We give two proofs of the isoperimetric inequality 
between the length of a closed curve without self-intersections and the area which 
it bounds. A part of the differential geometry of plane curves is not related to a 
Riemannian metric, that is, remains beyond the scope of Riemannian geometry. It 
includes enveloping families of curves, affine unimodular differential geometry, and 
projective differential geometry. The chapter about plane curves is concluded by 
elements of integral geometry: we derive a formula expressing the measure of a set 
of straight lines intersecting a given curve. 

The second chapter studies curves in spaces, first in three-dimensional space 
and then in many-dimensional ones. Given a curve in three-dimensional space, we 
define curvature and torsion, derive the Frenet—Serret formula, and define osculating 
planes and spheres. We also define the total curvature of a closed curve and prove 
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Fenchel’s theorem (that the total curvature of a closed curve is at least 277) and 
the Fary-Milnor theorem (that the total curvature of a knotted closed curve is at 
least 477). At the end of the chapter, we consider curves in many-dimensional space, 
define quantities generalizing curvature and torsion for such curves, and derive the 
generalized Frenet—Serret formulas. 

The third chapter is devoted to surfaces in three-dimensional space. On a surface 
in R®, the first quadratic form is introduced; this is the inner product of tangent 
vectors to the surface. With a curve on a surface, we associate a Darboux frame 
and use it to define the geodesic curvature, the normal curvature, and the geodesic 
torsion of a curve on a surface. A geodesic on a surface is a curve with zero geodesic 
curvature; any shortest curve on a surface joining two given points is geodesic. 
On a surface in R°, the second quadratic form is also defined. In a basis with 
respect to which the matrix of the first quadratic form is the identity matrix and 
the matrix of the second quadratic form is diagonal, the diagonal elements of the 
second quadratic form are the principal curvatures. The Gaussian curvature of a 
surface is the product of principal curvatures. The principal curvatures cannot be 
expressed only in terms of the first quadratic form, but the Gaussian curvature can. 
The Gaussian curvature of a surface can also be defined in a different way, in terms 
of differential forms on the surface. The integral of the Gaussian curvature over 
a polygon on the surface can be related to the integral of the geodesic curvature 
over the boundary of this polygon and the sum of exterior angles of the polygon 
(the Gauss—Bonnet formula). We define the parallel transport of a vector along a 
curve and the covariant differentiation of vector fields and introduce the Riemannian 
curvature tensor. Using geodesics, we define the exponential map of a tangent space 
to a surface. To study properties of geodesics, we derive the first and the second 
variation formula. Using Jacobi vector fields and conjugate points, we find out when 
the length of a geodesic is not globally minimal. We prove the theorem on the local 
isometry of surfaces of constant Gaussian curvature. At the end of the chapter, we 
introduce the Laplace—Beltrami operator, which is a generalization to surfaces of 
the Laplace operator in the plane. 

In the fourth chapter we discuss two topics, hypersurfaces in many-dimensional 
space and connections on vector bundles. The study of connections on vector 
bundles is based on certain prerequisites concerning manifolds and vector bundles 
over manifolds. Thus, beginning in Chap. 4, the reader is supposed to have 
background knowledge of manifolds, tangent vectors, differential forms, vector 
bundles over manifolds, sections of bundles, and the inverse function theorem; 
all the necessary information can be found in the books [Pr2] and [Pr3]. For a 
hypersurface in Euclidean space, we define the Weingarten operator and use it to 
introduce the second, third, etc. quadratic forms. We define connections first on 
hypersurfaces and then on any manifolds and vector bundles over manifolds. In 
parallel, we introduce geodesics with respect to a given connection. We also define 
the curvature tensor and the torsion tensor of a given connection and introduce the 
curvature matrix of a connection. 

The fifth chapter is concerned with the general theory of Riemannian manifolds. 
A Riemannian manifold is a manifold whose every tangent space is equipped with 
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An inner product (Riemannian metric). On a Riemannian manifold, there exists a 
unique torsion-free connection compatible with the Riemannian metric (the Levi- 
Civita connection). The Riemann tensor of this connection has several symmetries 
(satisfies several identities). Geodesics on Riemannian manifolds have some specific 
features in comparison with geodesics for arbitrary connections; in particular, any 
such geodesic is locally a shortest curve. A Riemannian manifold is said to be 
geodesically complete if all geodesics on this manifold can be extended without 
bound. According to the Hopf—Rinow theorem, geodesic completeness is equivalent 
to the completeness of the Riemannian manifold as a metric space. The Riemann 
tensor can be described by using the sectional curvatures corresponding to two- 
dimensional subspaces. For Riemannian submanifolds, as well as for hypersurfaces, 
we can introduce the second quadratic form and prove generalizations of Gauss’ 
and Weingarten’s formulas. An important class of Riemannian submanifolds is 
formed by totally geodesic submanifolds (a submanifold M is totally geodesic if 
each geodesic on M is also a geodesic on the ambient manifold). In the many- 
dimensional case, just as in the case of surfaces, we obtain the first and the second 
variation formulas and use them to introduce Jacobi fields and define conjugate 
points. The chapter is concluded by a discussion of the holonomy (transformations 
of the tangent space obtained by the parallel transport of vectors along closed 
curves) and an interpretation of curvature as infinitesimal holonomy. 

The sixth chapter discusses the differential geometry of Lie groups, that is, 
manifolds endowed with a group structure consistent with the smooth structure. 
With each Lie group, its Lie algebra is associated, which is the tangent space at 
the identity element in which the multiplication of elements is defined as taking the 
commutator of the left-invariant vector fields corresponding to tangent vectors. The 
Lie algebra of a Lie group is mapped to this Lie group by the exponential map. 
For a Lie group and a Lie algebra, adjoint representations are defined; the adjoint 
representation of a Lie algebra is used to define the Killing form. On a Lie group, 
there exist various connections and metrics related to the group structure in various 
ways. On a compact Lie group, invariant integration can be defined. Some properties 
of Lie groups are possessed by more general spaces, namely, by homogeneous and 
symmetric ones. 

The last (seventh) chapter is devoted to some of the applications of differential 
geometry: comparison theorems, relationship between curvature and topological 
properties of manifolds, and the Laplace operator on Riemannian manifolds. 
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k is the curvature of a curve in the plane (p. 3) or space (p. 48) 
€1, €2, €3 is the Serret—Frenet frame, p. 47 

x is the torsion of a space curve, p. 48 

E, F, and G are the coefficients of the first quadratic form, p. 66 
gij are the coefficients of the first quadratic form, p. 67 
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g’/ are the entries of the matrix inverse to the matrix (gij), p. 68 

€1, €2, €3 is the Darboux frame, p. 68 

Kg is geodesic curvature, p. 69 

kn is normal curvature, p. 69 

%g is geodesic torsion, p. 69 

L, M, and N are the coefficients of the second quadratic form, p. 72 

#7 is mean curvature, p. 73 

K is Gaussian curvature, p. 73 

ri, are the Christoffel symbols, p. 85 

S? is the sphere, p. 87 

Vw V is the covariant derivative of a vector field V in the direction of a vector field 
W, p. 93 

BR jk is the Riemannian curvature tensor, p. 98 

exp, (V) is the exponential map, p. 100 
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Chapter 1 ® 
Curves in the Plane Ghost for 


The simplest object of differential geometry is a curve in the plane. The definition 
of a curve varies between different areas of mathematics. In many cases, it is 
natural to represent a curve as the trajectory of a moving point. In doing so, 
one should distinguish between a parameterized curve y(t) = (x(t), y(t)) anda 
nonparameterized curve, which is the image of a parameterized curve, i.e., a set in 
the plane. The functions x(t) and y(t) are not arbitrary. They are usually assumed 
to be smooth. But even under this assumption, a nonparameterized curve may have 
corners. This can be avoided by requiring the derivatives x’(t) and y’(t) not to vanish 
simultaneously. In that case, the parameterized curve is said to be smooth. A smooth 
nonparameterized curve is the image of a smooth parameterized curve. 

Geometry deals with both closed and nonclosed (that is, joining two different 
points) parameterized curves. A smooth (not necessarily closed) parameterized 
curve in the plane is a map y: [a, b] > IR? such that y(t) = (x (t), y(t), where x 
and y are smooth functions, and v(t) = Ht) # 0 for all t € [a, b] (we assume that 
the derivative have finite limits att = a and t = b). A smooth parameterized curve 
y: [a,b] —> IR? is said to be closed if y(a) = y(b) and v(a) = v(b). 

The length of a curve y: [a,b] > R? can be defined as the limit of the lengths 
of polygonal chains with vertices on the curve. In more detail, we choose a partition 
a=t <t, <-:: < % = b of the interval [a, b], consider the polygonal chain 
PoP... Pn, where P; = (x(t;), y@j)), and find the limit of the lengths of these 
polygonal chains as the maximum of the numbers Af; = t; — tj; tends to zero. 

It is easy to show that the length of a curve y : [a, b] > R? is equal to 


b 
/ Je)? + o"oyde. 
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Indeed, the length of a polygonal chain Pp P| ... P, equals 
n 
Yo Gi — 41-1)? + OF — yi-1)?, 


where x; = x(t;) and y; = y(t;). Using the mean value theorem, we can rewrite 
this sum in the form )77_, Ati (x/(&))? + (y’(mi))*, where the &; and nj are some 
numbers between ¢;_; and t;. The limit of this sum has the required form. 

The area of the figure bounded by a closed curve y: [a, b] — R? without self- 
intersections can be defined as the least upper bound for the areas of polygons 
contained in it and the greatest lower bound for the areas of polygons containing 
it. A figure for which these two numbers are equal is said to be squarable. The area 
of nonsquarable figures is not defined. Any figure bounded by a smooth closed curve 
is squarable. 

The oriented area of a figure bounded by a parameterized closed curve without 
self-intersections equals the area of this figure in absolute value. When the curve 
is traversed counterclockwise, the oriented area is positive, and when it is traversed 
clockwise, the oriented area is negative. 

A formula for the oriented area of a figure bounded by a parameterized (possibly 
self-intersecting) curve can be obtained from the following formula for the oriented 
area A of the triangle with vertices (0, 0), (x1, y1), and (x2, y2): 


1 
A= 3 192 — x2y1). 


By analogy with this formula, the oriented area of a polygon with consecutive 
vertices (x;, yi), = 0, 1,...,, can be defined as 


1 n 
5 Y[@iyiti — x1419i), 
i=0 


where (Xn41, Yn+1) = (Xo, yo). The polygon may have self-intersections. 

The formula for the oriented area of a figure bounded by a closed curve 
y: [a,b] — R? can now be obtained by choosing a partition a = 9 < t) < 

- < t = b of the interval [a,b], considering a polygon PoP, ... Pn, where 
P; = (x(t), y(t)), and passing to the limit of the oriented areas of such polygons 
as the maximum of the numbers Af; = t; — t;_ tends to zero. 

We set x; = x(f;) and y; = y(t;) and use the mean value theorem: xj+; = 
Xj +x'(ni) Ati, Vier = yi + y' (&) At;. As a result, we obtain the following formula 
for the oriented area of a polygon PoP, ... Ph: 


1 n—-1 
5 DL ttiyE) — yx") Ani, 


i=0 
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where At; = tj41 — tj and &, nj € [t;, tj41]. If the numbers Az; are positive and the 
maximum among them tends to zero, then we obtain the integral expression 


1 b 
; i: (xy! — yx")dt 
a 


for the oriented area of the figure bounded by the curve y. Integrating by parts, it is 


easy to show that 
b b 
-| ya'ar = f xy'dt. 
a a 


Indeed, 


b b 
/ yx'dt + / xy'dt = (ay)|? =0, 
a a 


because the curve is closed. 
Thus, the oriented area A of a figure bounded by a smooth closed curve can be 
calculated by any of the following three equivalent formulas: 


b b 1 b 
A= -| yx'dt = i xy'dt = >| (xy! — yx’)dt. (1.1) 
a a 2 a 


Problem 1.1 A closed curve y bounds a convex figure. The endpoints of a chord 
of length a + b move on the curve y. A point M of this chord divides it in the ratio 
a: b. As the chord moves, M traces a closed curve y’. Prove that the area of the 
figure bounded by the curves y and y’ equals zab. 


1.1 Curvature and the Frenet—Serret Formulas 


Let y(t) = (x (t), y(t)) be a smooth curve. It is often convenient to replace the 
parameter ¢ by the arc length parameter s = s(t) = ifs ||u(t)|| dt, where v(t) = 
ay) The arc length parameter is the length of the arc of y enclosed between the 
points y (0) and y(f). 

For the arc length parameter, we have a = ||v(¢)||. Therefore, ay a o . a = 


= 1. 


d 
WO whence | + 


Iwo 
The endpoint of the vector v(s) = a) moves along the unit circle; therefore, 


Ly. Let n be the unit vector in R* orthogonal to v and such that the vectors v and 
n form a positively oriented basis, i.e., the rotation from v to n is counterclockwise. 
Then the vector n is determined uniquely and ats) = k(s)n. The number k(s) 
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Fig. 1.1 The change of the a 
sign of oriented curvature a, fo a 


k<0 k>0 


is called the oriented curvature of the curve y at the point y(s), and the number 
|k(s)| is called simply the curvature. Under the change of the curve orientation, the 
oriented curvature changes sign (see Fig. 1.1). 

The length of the vector v(s) equals 1; therefore, it has coordinates (cos g, sing). 
The vector n(s) obtained by rotating v(s) through 90° counterclockwise has 
coordinates (— sing, cosy). Therefore, a a oe (—sing,cosg) = fon. Thus, 
k(s) = ae. where ¢ is the angle between the velocity vector uv(s) and some constant 
vector. 


Problem 1.2 Prove that the curvature of a circle of radius R equals 1/R. 


Problem 1.2 explains why the quantity R(s) = 1/|k(s)| is called the radius of 
curvature of the curve at the point s. 


Theorem 1.1 (Frenet-Serret Formulas) /f s is the arc length parameter, then 


a = kn and a = —kov, where k is oriented curvature. 


Proof Let so be a fixed value of the parameter. The frame v(s), n(s) is obtained by 
rotating v(so), (so) through an angle ¢g(s): 


ea) _ ( cos g(s) sin ) Ear 
n(s))  \—sing(s) cos g(s)) \n(so))’ 


here ~(so) = 0. Therefore, 
9 (s9)\ _ dy (- sin g(50) cos y(s0) ) 3) 
dnis) = —(so) ‘ 
=e 4S) ds — cos p(so) — sin p(so)/ \n(s0) 


a ( 0 "| ee 
~“\-10) \n(s9))° 


HISTORICAL COMMENT The Frenet—Serret formulas for curves in_ three- 
dimensional space were derived almost simultaneously by Joseph Alfred Serret 
(1819-1885) and Jean Frédéric Frenet (1816-1900). Their papers were published 
in different issues of the same journal: Serret’s in 1851 and Frenet’s in 1852. These 
formulas are sometimes called the Frenet formulas. 


| 


1.1 Curvature and the Frenet—Serret Formulas a 


Fig. 1.2 The support line 


Problem 1.3 Prove that the curvature of a curve y(t) = (x(t), y(t)) with any 
parameterization ¢ can be calculated by 


2 = Gy _ yx!) 
a aa") 


Problem 1.4 Calculate the curvature of the ellipse x + y = | at each point. 


Problem 1.5 Suppose that a curve y and a circle of radius R are oriented positively 
(i.e., counterclockwise), are tangent at a point P, and have the same velocity vector 
at this point, and the oriented curvature of the curve at P equals k. Prove that if 
k> ze then a small neighborhood of the point P on the curve lies inside the circle, 
and if k < ze then it lies outside the circle. 


Problem 1.6 Suppose that a curve r(y) bounds a convex figure containing the 
origin O and the parameter ¢ is the angle between the Ox axis and the perpendicular 
from O to the support line at the point r(g) (see Fig. 1.2). Let p(@) be the distance 
between O and the support line. Prove that the radius of curvature at the point r(¢) 


equals p(y) + p" (9). 


The Frenet—Serret formulas suggest the following geometric interpretation of the 
curvature of a curve. Let us draw a normal through each point y(s) of a curve y 
and mark off two points at a distance ¢ (small enough) from the point y(s) on this 
normal. These points trace two curves yi,(s). Consider the arcs of these curves 
corresponding to s € [a,b]. The geometric interpretation of the curvature k(s) 
is as follows: as a,b — So, the ratio of the length of the corresponding arc of 
the curve yi,(s) to the length of the arc of the initial curve equals 1 + ek(so) or 
1 —ek(sg) in the first approximation. Indeed, we have yi,¢(s) = y(s)+en(s), where 
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the vector n(s) is defined by ays) = k(s)n(s), s being the arc length parameter. 


According to the Frenet—Serret formulas, we have ants) = —k(s)v(s); therefore, 
dyes) = u(s)Fek(s)v(s) tek’ (s)n(s). The summand ek’ (s)n(s) is of order O(e), 


AY 
because the vectors v(s) and n(s) are orthogonal. Thus, in the first approximation, 


the length of the velocity vector of the curve yie(s) equals 1 + ek(s); therefore, 


the arc length of this curve equals rei + ek(s))ds in the first approximation. This 
gives the required equation, because the length of the arc of the initial curve equals 
b-a. 


1.2 Osculating Circles 


Consider two plane curves given by the equations y = f(x) and y = g(x). These 
curves intersect in a point (xo, yo) if f(xo) = g(xo) = yo. The curves are tangent 
(have contact of order 1) at the intersection point if f’(xo) = g’(xo). For tangent 
curves, we choose a coordinate system with Ox axis directed along the tangent line 
to these curves (then f’(xo) = g’(xo) = 0). Tangent curves have contact of order n 
if f’ (xo) = g" (x0), --., f (x0) = g™ (xo). Curves having contact of order 2 are 
said to be osculating. 

A point at which a curve and its tangent line have contact of order 2 is called 
an inflection point of this curve. A point (xo, f(xo)) of a curve y = f(x) is an 
inflection point if and only if f” (xo) = 0. 

If the curvature of a curve is nonzero at some point, then a circle osculating the 
curve at this point (osculating circle) is determined uniquely. The equation of the 
osculating circle in the case where the curve is given by an equation y = f(x) is 
derived as follows. Suppose that a circle of radius R centered at (a, b) is locally 
determined by an implicit function y = g(x), ie., g satisfies the equation 


(x — a)? + (g(x) —b)? = RP’. 
Differentiating this equation, we successively obtain 
2(x — a) + 2g'(x)(g(x) — b) = 0, 
2 
2 + 2g"(x)(g(x) — b) + 2(8'(x))" = 0. 
If the curve y = f(x) osculates the circle at a point (x9, yo), then 
(xo — a)” + (yo — by” = R?, 
(xo — a) + yo(v0 — b) = 0, 
1+ yo(vo — b) + (6) = 9, 


1.3. The Total Curvature of a Closed Plane Curve 7 


where yy = f’(xo) and yy = f” (xo). Moreover, if this system of equations for a, 
b, and R has a solution, then the converse is also true: the curve osculates the circle. 
It is easy to see that if yj 4 0, then this system of equations has a unique solution. 
Recall that we chose coordinates in which Y = 0; in these coordinates a = xo, 


b= yr sr, and R2 = Bint According to Problem 1.3, in the situation under 
Yo Yo 
Ny? 
consideration, we have k? = ts = ( yo 2, Thus, the curvature of the osculating 
Yo 


circle equals that of the curve at the point of contact. 

The center of the osculating circle is called the center of curvature of the curve. 
The center of curvature of a curve y at a point y(s) lies on the normal to y at this 
point at a distance +1/k(s) from it. The center of curvature is the point y(s) + 
rays): recall that the direction of the normal vector is chosen so that the vectors 
u(s) and n(s) form a positively oriented basis. 


Problem 1.7 Prove that the center of curvature is the limit position of the intersec- 
tion point of close normals. 


Below we give yet another interpretation of the center of curvature. Let y(s) = 
(x (s), y(s)) be a curve parameterized by arc length. Fix a point g = (xo, yo) in the 
plane and consider the function F(s) = ||y(s) —q||* on y whose value at each point 
equals the squared distance from this point to the fixed point. Clearly, 


’ dy 
F'(s) = 2(r0) —4, ~) ; 


dy 


F’ =) 
w= 2|2 


2 ay 
+2 (vw -—4q, “e) = 2(1+ (y(s) — 4, kn)). 


Therefore, so is a critical point of the function F if and only if the vector y (so)gq is 
orthogonal to the curve y at y(so), and a critical point so is degenerate if and only 
if k(so) A O and y(so)q = Kay A point q is said to be a focal point of a curve 
y if the function |/y(s) — q ||? has a degenerate critical point. If such a degenerate 
critical point is so, then the point g is uniquely determined by y(so)q = Kay” 
Geometrically, this means that the point q is the center of the circle osculating the 
curve y at y(so). Thus, q is the limit position of the intersection point of the normals 
to y at the points y (s1) and y(s2) as s} > so and s2 — so. In particular, the property 
of being focal does not depend on the choice of parameterization. 


1.3. The Total Curvature of a Closed Plane Curve 


The total oriented curvature of a closed curve y : [a, b] > R? is defined as ee kds, 
where s is the arc length parameter and k is oriented curvature of y. Recall that 
k(s) = ae, where ¢ is the angle between the velocity vector v(s) and some constant 
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Fig. 1.3. Total oriented curvature 


vector (see p. 4). Therefore, { & kds = 2xN, where N is the number of full turns 
(counted with sign) made by the velocity vector as the whole curve is traversed (see 
Fig. 1.3). 

The number WN can be calculated as follows. Consider the function g(s), s € 
[a, b]. Recall that a point so is said to be critical if ae (so) = 0, and the value of g(s) 
at a critical point is called a critical value. The measure of the set of critical values 
is zero. Let us choose a noncritical value g and consider only those s for which 
y(s) = @. For each of these s, we have either g’(s) > 0 or y’(s) < 0. In the former 
case, the velocity vector rotates in the positive direction; we denote the number 
of such s by n+(g). In the latter case, the velocity vector rotates in the negative 
direction; we denote the number of such s by n_(g). An interval of length Ag on 
which both n+ (g) and n_ (@) are constant contributes (n+ —n_) Ag to the integral of 
oriented curvature; therefore, the total oriented curvature is if iw (n+(g)—n_(¢)) dg. 


Remark It can be shown that the difference n4(g) — n_(g) does not depend on 
the chosen noncritical value yg. For example, on passage through a simple critical 
point, either the numbers n+ and n_ do not change or one of them increases by 1 
and the other decreases by |. Thus, the number of turns of the velocity vector equals 
n(g) — n_(@) for any noncritical value g. 


Theorem 1.2 (Umlaufsatz) The total oriented curvature of a smooth closed curve 
without self-intersections equals +27. 


Proof Let us translate a straight line not intersecting the given curve y(s) until it 
touches this curve at some point. Note that the curve lies on one side of the line. 
Thus, we can assume that the coordinate system and the natural parameter s ¢€ [0, /] 
on the curve y(s) are chosen so that the curve lies in the upper half-plane and y (0) 
is the origin. 

Let T be the triangle consisting of all points (s1, 52) whose coordinates satisfy 
the inequalities 0 < sj < s2 < /. Consider the map y of T to the unit circle S! 
defined as follows: if s; 4 sz and (51, s2) 4 (0, 1), then 


pu tS y'(0) 
as 1S, Ope 
Vora: yO. =~ THO 


ac Ion 
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Fig. 1.4 The family of B(0,1) Cl) 
curves 


A(0, 0) 


For a curve without self-intersections, this map is well-defined (does not involve 
division by zero) and continuous. Its continuity at the point (s, 5) is obvious, and to 
prove continuity at the point (0, /), it suffices to note that 


yOH-yves) —_,. y)-—y@) 
im ——_— = — lim ————__., 
s>0 lly —y@O)l s>0 lly(s) — YO) 


Let A(O, 0), B(O,/), and C(/,/) be the vertices of T. To the parameter s the 
restriction of y to the side AC assigns the tangent vector y’(s). Therefore, the point 
w(s, s) makes as many turns while traversing AC as the velocity vector does while 
traversing the curve. 

The curve consisting of the segment AC can be connected to that consisting of 
the segments A B and BC through a continuous family of curves (see Fig. 1.4). Thus, 
it remains to prove that, while traversing AB and BC, a point on the circle makes 
one turn (in the positive or negative direction). 

As the segment AB is traversed, the vector y (s) — y (0) applied to origin remains 
in the same half-plane and changes its direction to the opposite one; therefore, it 
makes a half-turn. While traversing BC, the vector y(/) — y(s) = y(O) — y(s) 
makes a half-turn in the same direction. As a result, the vector makes one full turn. 

oO 


HISTORICAL COMMENT Umlaufsatz was proved by Heinz Hopf (1894-1971) in 
1933. 


Together with the total oriented curvature / ? kds one can consider the total 


curvature ie |k| ds. The contributions of all passages through a noncritical value 
g to the total curvature are the same, independently of their sign; therefore, the total 
curvature equals i n(g)dg, where n(g) = n4(~) +n_(@) is the number of curve 
points at which the velocity vector has given direction. 

A closed plane curve is said to be strictly convex if any straight line intersects it 
in at most two points. A curve is strictly convex if and only if n(g) = 1 for all g. 
A closed plane curve is convex if it lies on one side of any tangent line. A convex 
curve differs from a strictly convex curve in that it may have rectilinear fragments. 
A curve is convex if and only if n(@) = 1 for all noncritical values g. 


10 1 Curves in the Plane 


Problem 1.8 Prove that a smooth closed curve without self-intersections is convex 
if and only if its oriented curvature does not change sign. 


1.4 Four-Vertex Theorem 


A vertex of a closed curve is a point of local maximum or minimum of its oriented 
curvature. Every closed curve has at least two vertices—global maximum and 
minimum points. It is easy to give examples of closed curves with precisely two 
vertices (see Fig. 1.5). But any curve without self-intersections has at least four 
vertices. First, we give a simple proof of a weaker statement (for only convex curves, 
rather than for all non-self-intersecting ones). 


Theorem 1.3 Any smooth convex closed curve has at least four vertices. 


Proof First, we prove that if y : [a,b] > R2 is a smooth closed curve, then 


b b b 
/ k'(s) ds = 0, / x(s)k'(s)ds = 0, / y(s)k'(s) ds = 0. 


The first equation holds because k(b) = k(a). To prove the other two, we apply 
integration by parts and the second Frenet—Serret formula: 


b b 
/ k'(s)(x(s), y(s))ds = -| k(s)(x'(s), y'(s))ds 


b b 
--f kvas = [ al at 
a a as 


Fig. 1.5 Curves with two 
vertices 
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Now suppose that y is a smooth convex closed curve. At the points of maximum 
and minimum of its oriented curvature, the derivative k’ changes sign. Consider the 
line px + gy +r = 0 joining a point of maximum of the oriented curvature to a 
point of minimum. Suppose that the derivative k’ does not change sign when passing 
through all other points. Then the product k’(px + gy +r) does not change sign on 
either of the arcs of the curve y into which y is divided by the line px +qy+r = 0, 
because both factors k’ and px + gy +r change sign at the points of maximum 
and minimum and at no other points. The constancy of the sign of k’(px + qy +r) 
contradicts the relation : k'(px + qy +r) = 0; hence there is at least one more 
point at which the derivative k’ changes sign. 

Thus, the function k’ changes sign at the points of maximum and minimum of the 
function k and at some point or points. It follows from the closedness of the curve 
that the number of sign changes must be even (if it is finite). Therefore, k’ changes 
sign at at least four points. Oo 


The proof of the four-vertex theorem for closed curves without self-intersections 
given below was suggested by Osserman [Os]. This proof uses properties of a circle 
circumscribed about a curve, i.e., a circle of smallest radius enclosing the curve. 


Lemma 


(a) Given a smooth closed curve y, there exists a unique circle C of smallest radius 
enclosing it (the circle circumscribed about the curve). 

(b) If the length of an arc of the circle circumscribed about a curve y is larger than 
half the length of the circle, then this arc contains at least one point of y. 


Proof 


(a) Given two circles of radius R enclosing the curve y, we can construct a circle 
of radius smaller than R also enclosing this curve (see Fig. 1.6). 

(b) Suppose that a circle enclosing y has an arc which is longer than half the length 
of the circle and does not intersect y. Then a translate of this circle contains no 
points of y; therefore, its radius can be decreased. 

| 


Fig. 1.6 Decreasing the 
radius of the circle 
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Theorem 1.4 Let C be the circle circumscribed about a smooth closed curve y 
without self-intersections. Then C contains at least two points of y, and if C 
contains n points of y, then y has at least 2n vertices. 


Proof Let P, ..., Py be the common points of the curve y and the circumscribed 
circle C numbered so that P| ... P, is a convex polygon. Obviously, n > 2. We 
assume that the curve y and the circumscribed circle C are oriented positively 
(counterclockwise). According to Problem 1.5 (see p. 5), we have k(P;) > 1/R, 
where R is the radius of C. It suffices to prove that each arc P; P;; has a point Q; 
for which k(Q;) < 1/R. Indeed, if this is so, then the oriented curvature has a local 
minimum between the points P; and P;+1 and a local maximum between the points 
Q; and Qi4+1. 

The required point Q; can be found as follows. Consider the arc y; between P; 
and P;,,. We can assume that the direction from P; to P;+1 is positive on both the 
arc y; of y and the corresponding are C; of C. According to the lemma, the arc C; is 
no longer than half the circle. We assume that the straight line P; P;+1 is vertical and 
the center of C lies on the left of this line (or on the very line). Choose an interior 
point Q on the arc y; which lies on the right of the line P; P;+1 (see Fig. 1.7). Then 
the radius R’ of the circumcircle C’ of the triangle P; Q P;+ is greater than R. 


Fig. 1.7 The choice of the 
points Q and Q; 


(ed 
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Let us move the circle C’ to the left until it takes off the arc y;. At the last moment 
the circle is tangent to the arc at some point Q; (see Fig. 1.7). The curve y has no 
self-intersections; therefore, at the point Q; the orientation of y coincides with that 
of the circle. It follows that k(Q;) < 1/R’. Indeed, if the oriented curvature of the 
curve y at Q; is positive, then k(Q;) < 1/R’ according to Problem 1.5, and if 
k(Q;) < 0, then k(Q;) < 1/R’ because R’ is positive. Thus, k(Q;) < 1/R’ < 1/R, 
as required. Oo 


HISTORICAL COMMENT The four-vertex theorem for convex curves was proved 
by the Indian mathematician Syamadas Mukhopadhyaya (1866-1937) in 1909. 


It can be proved that if a self-intersecting curve bounds a two-dimensional 
surface immersed in the plane, then this curve has at least four vertices (see [Pi]), 
and if the immersed surface is not a disk, then it has at least six vertices (see [Cal] 
or [Um]). 


1.5 The Natural Equation of a Plane Curve 


Let y(s) = (x (s), y(s)) be a smooth plane curve, where s is the natural 
parameter. Denoting by k(s) the oriented curvature at s, we obtain the function k(s) 
corresponding to the curve y. 


Theorem 1.5 Any smooth function k(s) corresponds to some curve y, and this 
curve is determined up to a direct motion of the plane. 


Proof It follows from the Frenet-Serret formulas v’ = kn and n’ = —kv that both 
functions €|,(s) = x’(s) and &1,(s) = y’(s) must satisfy the system of differential 
equations Ei (s) = k(s)é&(s), E5(s) = —k(s)&(s). Let (&€1,, &2,) be its solution 
with initial condition (&), (0), &2,(0)) = (1, 0), and let (€1,, &2y) be the solution 
with initial condition (&) (0), &2y(0)) = (0, 1). We set 


xis) = [ &1x(t)dt, ys) =f E1y(t)dt 


and consider the vectors v(s) = (€1x, €1y) and n(s) = (&2y, &2y). Clearly, v(s) = 
(x'(s), y’(s)) and v'(s) = kn(s); therefore, it suffices to check that the vectors v(s) 
and n(s) have unit length. 

Suppose that each of the indices a and b takes the value x or y. Then (€¢&15 + 
Exq&2p)! = kérabiy + kEla€an — kE1aE — kéxa&1yp = 0. For s = O, the matrix 
G (s) 82x ) equals € ‘). The rows of this matrix are mutually orthogonal for 

E1y(s) &2y(s) 01 
all s, that is, the matrix is orthogonal, so that its columns are mutually orthogonal as 
well. This means that v and n are orthogonal vectors of unit length. 

Now let us prove the uniqueness of the curve. Let y(s) and y(s) be two curves 
corresponding to the same function k(s). Suppose that at s = O the vector v = 
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(1x, €1y) coincides with v = = (Gis E1y) and the vector n = (&x, &2y) coincides 
with 1 = (x, by). Since (E1a&1p + frakop)! = = 0, it follows that, for all s, we have 
E1rE 1, + Bax8ox = = | and StySty zr foyboy = = 1; thus, (1x _ Ex)? a (2x _— Ey)? + 

(E1y — &1y)? + (Gy — &y)? = 0. Oo 


The natural equation of a plane curve with natural parameter s is the specification 
of the curve by the oriented curvature function k(s). Theorem 1.5 shows that a 
natural equation with smooth function k(s) always has a solution and the natural 
equation of a curve determines this curve uniquely up to a direct motion. 


1.6 Whitney—Graustein Theorem 


Let yo, y1 : [0, 1] > R? be smooth closed curves. We say that the curves yo and y; 
are regularly homotopic if there exists a family of smooth closed curves y, smoothly 
depending on ¢ € [0, 1] (and such that y, = yo fort = O and y, = y fort = 1). 
Smooth dependence on t means that the map (s, t) FH y;(s) is smooth as a map 
from [0, 1] x [0, 1] to R?. 


Theorem 1.6 (Whitney—Graustein) Curves yo and y, are regularly homotopic if 
and only if they have equal total oriented curvatures. 


Proof Recall that the total oriented curvature of a plane closed curve equals 27 N, 
where WN is the number of full turns (counted with sign) made by the velocity vector 
as the whole curve is traversed. Let , be a regular homotopy between the curves 
yo and y;, and let 27 N; be the total curvature of y,. The integer N; continuously 
depends on ¢. Therefore, it is constant, and hence No = N}. 

Now suppose that yo and y; are smooth closed curves of the same total curvature 
N. Using a regular homotopy, we can replace each of the curves yo and y; by acurve 
of length | with initial point at the origin and initial velocity vector (1, 0). Therefore, 
we can assume that s € [0, 1] is the natural parameter, yo(0) = v1 (0) = (0, 0), and 
7 (0) 4 Yt (0) = (1, 0). We can write the velocity vectors of the curves yo and yy 
in the forms vo(s) = e/) and v1 (s) = e!f!), where yo(0) = y (0) = O and 
go(1) = g1(1) = 2rN. Let us set g;(s) = (1 — t)go(s) + tg) (s) and consider the 
curve y; with velocity vector u;(s) = e'@: 7,(s) = is e@@dr, ForO <t <1, 
the curve } is not necessarily closed, but we can use it to construct the closed 
curve y;(s) = %(s) — 5%, (1) = fy lM dt — s i e'% dz. Tt remains to check 
that the curve 7 is smooth, i.e., 4) — oy (O) and fy (s) ~ 0. Clearly, 
4 yj(s) = ef) — ie el@Odr = v,(s) — 7;(1). The velocities at s = 0 and at 
s = | are equal because g,;(0) = 0 and g;(1) = 27 N, whence v;(0) = v;(1). To 
show that v;(s) 4 (1), it suffices to note that ||v;(s)|| = 1, while ||¥%(1)|| < 1, 


since ||7;(1)|| = fo elede| < i |e’? | dr < 1 and the function e” is not 
constant. o 
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HISTORICAL COMMENT The proof of the Whitney—Graustein theorem was pub- 
lished in 1937 in a paper by Hassler Whitney (1907-1989) with a note that the 
statement and the full proof of this theorem was communicated to him by William 
Caspar Graustein (1888-1941). 


1.7. Tube Area and Steiner’s Formula 


Let y : [a, b] > R? be a smooth curve without self-intersections. We assume that 
the parameterization of this curve is natural, i.e., ||y’(s)|| = 1. Suppose that, at each 
point of the curve, the Frenet—Serret frame v(s) = y’(s), n(s) is given. For each «, 
we can consider the curve y,(s) = y(s) + en(s). A tube of diameter 2r is the set 
of points swept out by the curves y, with —r < ¢ < r. We can also consider the 
positive tube (0 < € <r) and the negative tube (—r < € < 0). 


Theorem 1.7 For sufficiently small r, the areas of the positive and negative tubes 
are, respectively, 


r2 b r2 b 
rL—- a) k(s)ds and rL+ =f k(s) ds, 
2 Ja 2 Ja 


where L the length of the curve and k is its oriented curvature. 


Proof For sufficiently small r, the curves y, with different ¢ € [—r, r] are pairwise 
disjoint and have no self-intersections. Therefore, the area of the positive tube equals 
ie Lede, where L, is the length of y,. The Frenet—Serret formulas imply 


dy.(s) _ dy(s) on) 
ds ds ds 


= v(s) — ek(s)v(s); 


therefore, for |e] < |1/k(s)|, 


b b 
Le = / (1 — ek(s))ds = L— | k(s) ds. 


Thus, the area of the positive tube equals 


r b r2 b 
i (z _ | k(s) as) de=rLl— =| k(s) ds, 
0 a a 


and the area of the negative tube equals 


0 b 72 pb 
i (.-«f koyds) de=rb+ > | k(s) ds. 
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Corollary 1 For sufficiently small r, the area of the tube is exactly 2r L. 


Corollary 2 (Steiner’s Formula) Let B be a plane figure bounded by a smooth 
closed curve without self-intersections, and let B, be the set of points of the plane at 
a distance of at most r from B. Then, for sufficiently small r, the area of the figure 
B, equals A+rL +:r?, where A is the area of B and L is the length of the curve 
bounding B. 


Proof We can assume that the orientation of the curve bounding B is positive. Then 
the total curvature of this curve equals 277, and for sufficiently small r, the figure B, 
is composed of B and the negative tube. oO 


Remark If the figure B is convex, then Steiner’s formula holds for all r > 0. 
Moreover, in the case of a convex figure, it suffices to require that the boundary 
be only piecewise smooth rather than smooth. 


HISTORICAL COMMENT Jakob Steiner (1796-1863) obtained formulas for the 
area of the r-neighborhood of a convex polygon in the plane and for the length of 
its boundary, as well as formulas for the volume of the r-neighborhood of a convex 
polyhedron in three-dimensional space and for the area of its boundary, in 1840. 


1.8 The Envelope of a Family of Curves 


Consider a family of plane curves Cy depending on a parameter a. The envelope of 
the family of curves Cy is a curve C tangent to each Cy. The envelope may have 
several connected components. For example, the envelope of the family of circles 
of fixed radius r centered on a given straight line / consists of the two straight lines 
parallel to / at a distance r from /. 

We assume that the plane is equipped with a Cartesian coordinate system Oxy 
and the curves Cy are given by equations of the form f(x, y,a@) = 0, where f isa 
smooth function. 

We will seek an equation for the envelope C in the parametric form x = ¢(@), 
y = (a), assuming that C is tangent to each Cy at the point (y (a), W(a)). 

We set x9 = ~(ao) and yo = w(ao). The tangents to the curves C and Cg, at the 
point (xo, yo) are given by the equations 


1 d 1 d 0 0 
ae ana b=) G35 Se (1.2) 
x—x9 da y-—yo da Ox dy 


These lines must coincide, which means that 


af dy, af dy _ 


cae i alge ge ee (1.3) 
ox da dy da 
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The point (y(@), v(a)) lies on the curve Cg, whence f(g@), w(a@), a) = 0 for all 
a. Differentiating this equation, we obtain 


df dg of dw of _ 


-—+—. —=0. 1.4 
ox da ody da da a a) 


Now, taking into account (1.3), we see that of = 0. Thus, the envelope (if it exists 
and is specified parametrically as above) can be found by eliminating the parameter 
a from the system equations 


0 
f(x, y,a@) =0, se, y,a) =0. (1.5) 
a 


For equations (1.2) to indeed determine tangent lines, we must require that the 
derivatives ar and at not vanish simultaneously at the point (xo, yo, ao). If this 
condition is satisfied, then the argument can be reversed, and the curve found by 
solving system (1.5) is indeed the envelope. 


Example 1.1 The equation for the envelope of the family of curves 
(x — a)? + (y — a)? = 3x — a) (y — a») 


is satisfied by the line x = a, y = a, i.e., x = y. However, this line is not the 
envelope; it consists of the self-intersection points of curves in this family. 


In many cases, the envelope can be found from the following geometric 
considerations. Suppose that each pair of curves Cy, and Cy, (a1 ¥ a2) intersect 
in one point and the intersection points converge to some point (x (a), y(a)) as 
a, — a and a2 — a. Then this point (x (a), y(a)) satisfies the equations for the 
envelope. Indeed, if f(x, y,a1) = Oand f(x, y, a2) = 0, then 


= f(x, y,a1) — f(x, y, a2) = of 


a, — a2 da 


0 (ey 


where a” is a point between a, and a2. Therefore, the point (x (a), y(a)) satisfies 
the required equations 


d 
f (x(a), y(@), «) = Oand (x(a), y(@), aw) =0. 


Problem 1.9 Find the envelope of the family of lines cutting off a triangle of area 
a’/2 from a given right angle. 


Problem 1.10 Let A and B be fixed points on the arms of an angle with vertex O 
and suppose that on the segments OA and OB points A; and B, are chosen so that 
OB, : B}B = AA, : AO. Prove that the envelope of the family of straight lines 
A,B, is an arc of a parabola. 
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Fig. 1.8 Astroid 


Problem 1.11 Find the envelope of the trajectories of a point particle shot from the 
origin with a speed of vo in a fixed vertical plane. 


The envelope in Problem 1.11 is called the parabola of safety. 


Problem 1.12 Prove that the envelope of the family of straight lines cutting out a 
segment of constant length / on the coordinate axes (see Fig. 1.8) is given by the 
equation 


x23 4 y2/3 = 2/3, 


The envelope in Problem 1.12 is called an astroid. 

An astroid is a representative of the important class of curves called hypocycloids 
and epicycloids. A hypocycloid is the trajectory of a marked point on a circle of 
radius r which rolls on an immovable circle of radius R, remaining inside the 
immovable circle, and an epicycloid is the trajectory of a marked point on a circle 
of radius r which rolls on an immovable circle of radius R, remaining outside this 
circle. 

First, we derive a parametric representation of a hypocycloid, i.e., the trajectory 
of a marked point on a circle of radius r rolling inside a circle of radius R. This 
motion of the marked point can be represented as the revolution of the center of 
the smaller circle along the circle of radius r; = R — r at an angular speed of @ 
and the rotation of the smaller circle of radius r2 = r at an angular speed of wp. 
The quantities @; and w2 have opposite signs and are related by (71) + r2)@1 = 
r2(—@2 + @1), which expresses the equality of the arcs of the immovable circle of 
radius R = r; +r2 and the moving circle of radius r = rp (it rolls without slipping). 
After reduction this equation takes the form 71w| = —r2a. 
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The trajectory of the marked point is given parametrically by 


xX=r,coSa@jt+r2cosadt, 


y=r,sin@t+r2sinaot. 


We set w; = 1 and substitute r} = R—r,rm =r,and a = =F =— Ror Asa 


result, we obtain the following parametric representation of a hypocycloid: 


R-r 
x = (R—r)cost +rcos t), 


. , R-r 
y = (R—r)sin¢t —rsin t}. 
r 


In the same way we obtain a parametric representation of the trajectory in the case 
where the circle of radius r rolls on the circle of radius R outside. Then the motion 
of the marked point can be represented as the revolution of the center of the circle 
of radius r2 = r along the circle of radius r} = R-+ 7 r at an angular speed of @; and 
the rotation of the circle of radius r2 = r at an angular speed of w2. The quantities 
@ and w2 have the same sign and are related by (r1 — r2)@1 = r2(@2 — @). After 
reduction this equation takes the form r}@; = 12@2. As a result, we obtain the 
following parametric representation of an epicycloid: 


<<") 
t], 
a 


7 ; R+r 
y =(R-+7r)sint +r sin t}. 
r 


x= (R41) cost +r eos( 


In both cases, at tf = 0, for the marked point we took the point corresponding to 
the zero angle on both circles. For the hypocycloid, this is the point of tangency of 
the circles, while for the epicycloid, this is the point farthest from the immovable 
circle. For the epicycloid, we might also take the point of tangency for the initial 
point. In that case, the parametric representation of the epicycloid would have the 
form 


R+r 
x = (R+1r)cost —rcos t), 
r 


, ; R+r 
y =(R-+7r)sin¢t —rsin t}. 
r 
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If the number r/R is rational, then the corresponding hypo- and epicycloids are 
closed curves with finitely many singular points (cusps). Some of them have special 
names. For example, an epicycloid with R = r (one cusp) is called a cardioid 
(because its shape resembles a heart), and an epicycloid with R = 2r (two cusps) 
is called a nephroid (because it resembles a kidney). Problem 1.13 shows that a 
hypocycloid with four cusps is an astroid. 


Problem 1.13 Prove that an astroid is the trajectory of a marked point on a circle 
of radius 1/4 which rolls within an immovable circle of radius 1. 


Problem 1.14 


(a) Choose a number k ¥ | and consider the family of straight lines throught the 
pairs of points e!? and e'*?, Prove that the envelope of this family of lines is a 
hypo- or epicycloid. 

(b) For each integer k ~ 1, find the number of cusps of the envelope. 


Problem 1.15 Prove that the envelope of the family of straight lines being the 
reflection of a pencil of parallel rays from a circular mirror is a nephroid (to be 
more precise, one half of a nephroid: see Fig. 1.9). 


Problem 1.16 Consider a circle S$ on which a point A is chosed. From the point A 
rays of light emanate and are reflected by the circle. Prove that the envelope of the 
reflected rays is a cardioid (see Fig. 1.10). 


Problem 1.17 Consider a circle S on which a point A is chosen. Prove that the 
envelope of the family of circles centered on S and passing through A is a cardioid. 


Fig. 1.9 Nephroid 
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Fig. 1.10 Cardioid 


1.9 Evolute and Involute 


The evolute E(s) of a curve y(s) is the set of all centers of curvature (i.e., focal 
points) of this curve. Since the center of curvature is the limit position of the 
intersection points of close normals to the curve, the evolute of a curve can also 
be defined as the envelope of the family of all normals to this curve. 

A curve y(s) with evolute E(s) is called an involute of the curve E(s). Each 
curve has only one evolute but infinitely many involutes. To construct one of the 
involutes of a curve E(s), we must consider the family of tangents to this curve, 
choose any point on one of the tangents, and construct its trajectory orthogonal to 
the tangents. 

The extrema of the curvature of a curve correspond to the singular points (cusps) 
of its evolute (see Fig. 1.11). 

The evolute E(s) of a curve y(s) can be defined parametrically by E(s) = y(s)+ 
R(s)n(s), where R(s) is the radius of curvature and n(s) is a normal vector. We 
assume that the parameterization of the curve y(s) is natural. Then, according to 


the Frenet—Serret formula, we have f(s) = —k(s)v(s) = — ORLOE therefore, 


dE. dy. dR cod 
7 = as) + Gy ns) + (s)7_() 
dR dR 
= v(s) + Fe on) — u(s) = —(s)n(s). 
Ss ds 


Thus, the length of the arc of E(s) between points with parameters sj and s2 equals 


I, 
S] 


dR 


—| ds. 
ds 
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Fig. 1.11 The evolute ina 
neighborhood of a curvature 
extremum 


If the curvature of y(s) is strictly monotone on this arc, then its length equals 


1 
k(s1) — k(s1) 


|R(s1) — R(s2)| = | 


Problem 1.18 Prove that if the curvature of a curve is monotone and does not 
change sign on an arc of this curve, then the osculating circles corresponding to 
the points of the arc are disjoint. 


HISTORICAL COMMENT The statement formulated in Problem 1.18 was proved by 
Peter Guthrie Tait (1831-1901) in 1895. 


Problem 1.19 


(a) Given an ellipse x + » = |, find its focal point corresponding to a point 
(a cos fo, D sin tg). 
(b) Find an equation of the evolute of an ellipse. 


Problem 1.20 Prove that the evolute of a parabola (f, t?) is the curve 
(- 43 a) 
J 2 - 

A cycloid is the curve traced by a marked point of a circle rolling along a fixed 


straight line. 


Problem 1.21 Prove that the evolute of a cycloid is a like cycloid (a translate of the 
given one). 
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1.10 Isoperimetric Inequality 


The isoperimetric inequality is the following inequality between the length L of 
a closed curve without self-intersections and the area A of the figure which it 
bounds: L? > 4s: A; the equality is attained only when the given curve is a circle. 
This inequality has various many-dimensional generalizations, which are often also 
called isoperimetric inequalities. 


The First Proof of the Isoperimetric Inequality Let y(s) = (x(s), y(s)) be a 
given curve with natural parameter s € [0, LZ]; we will assume that the curve 
has positive orientation, i.e., is traversed in the counterclockwise direction. Let us 
draw two parallel support lines of y, choosing their direction so that y contain no 
segments parallel to this direction. In the strip formed by these support lines we 
inscribe a circle y not intersecting y (see Fig. 1.12). We assume that the coordinate 
system is chosen so that the origin O coincides with the center of y and the Ox 
axis 1s perpendicular to the support lines. Let us parameterize the circle y by the 
same parameter s so that each point y(s) = (x(s), y(s)) satisfies the equation 
x(s) = x(s) and the circle is traversed counterclockwise (see Fig. 1.12). For the 
circle, the parameter s is not necessarily natural. 

Recall that, according to (1.1) (see p. 3), the area A of the figure bounded by a 
smooth curve y: [a, b] > R?, y(t) = (x (t), y(t), can be calculated by any of the 
three equivalent formulas 


b b 1 b 
A= -{ yx'dt = / xy'dt = >| (xy’ — yx’)dt. 
a a 2 a 


Thus, for the areas A and A of the figures bounded by y and 7, we have the 


expressions A = i. xy’ds and A = — i ¥x/ds. It is also clear that A = zr’, 


where r is the radius of the circle y. Therefore, taking into account the equations 
X(s) = x(s) and (x’)? + ( y’ )? = 1 (recall that s is the natural parameter), we obtain 


L L 
A+ 2 = / ’_ Sx" < | , sy 2d 
mr : (xy — yx')ds : / (xy! — yx')-ds 
L L 
< / y (4? +. 97) (x)? + (ds = / yf (x? + y?)ds = Lr. 
0 0 


Let us combine the inequality proved above and the arithmetic-geometric mean 
inequality: 


1 1 
VA-Var2< fA +ar’)< alr 


This inequality is easily transformed into the required form 4Az < L?. 
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oe The curve and the (a(s) , y(s)) 


K 


or > 


Now let us prove that the equality is attained only in the case where y is a circle. 
First, the arithmetic-geometric mean inequality must turn into an equality, so that 
we must have A = zr’, ie., L = 2zr. (This implies, in particular, that the radius r 
does not depend on the direction of the support lines.) Secondly, we must have 


(xy! — px’)? =? + 9 )(0')? + (y'”), 
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ie., (xx’ + yy’)? = 0. Therefore, 


5_, V¥G@?+y?) 


x 
y! SOP +6) 


whence x = -try’. But the radius r does not depend on the direction of the support 
lines; therefore, considering the support lines perpendicular to the initial ones (and 
placing the origin at the intersection point of the axes of symmetry of the two 
perpendicular strips formed by the support lines), we obtain y = +rx’. Thus, 
x? + y? = (x)? + (yr? = 7? o 
The Second (Hurwitz’) Proof of the Isoperimetric Inequality We need Wirtinger’s 
inequality, one of whose simplest proofs is based on the theory of Fourier series. O 


Lemma (Wirtinger’s Inequality) Let f(t) be a 21-periodic continuously dif- 
ferentiable function. Suppose that i f@dt = 0. Then i (f'(t))*dt > 
ie” (f (t))*dt, and the equality holds only for f(t) = acost + bsint. 


Proof From the theory of Fourier series we obtain 


lee) 
f= > + S\(an cos nt + By sinnt), 
n=1 
[o@) 
fi@= S(nbn cosnt — nap sinnt), 


n=1 


where ap = ie f (dt = 0 by assumption. According to Parseval’s identity, 


Qn o 
[roa =e +09. 
0 n=1 


Qn oo 
[ roar = ra + op: 
0 


n=1 


therefore, 
Qn Qn oo 
/ (fo) dt — / (f(t = Yin? — 1a, + b;) = 0, 
0 0 Wa 


and the equality holds only if a, = b, =O forn > 1. Oo 


We proceed to the proof of the isoperimetric inequality itself. It suffices to 
consider the case of L = 27. In this case, it is required to prove that A < z. 
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Choose a coordinate system so that the center of mass of the curve lies on the 
Oy axis, i.e., fea x(s)ds = 0 (s is the natural parameter). We again use the area 


formula which we have already used in the first proof: A = i. ” xy'ds. It follows 
from the identity (x’)? + (y’)? = 1 that 27 = aa (60 + (y’)?)ds. Therefore, 


20 
2(n — A) = i ((x’)? + (y'? — 2xy')ds 
0 


20 20 
= / ((x')? — x)ds +f (x — y’)*ds. 
0 0 


According to Wirtinger’s inequality, the first term is nonnegative, and hence A < z, 
as required. 

The equality holds when x(s) = acoss + bsins andx — y’ = 0,ie., y(s) = 
asins — bcoss +c. These equations determine a circle. 


1.11 Affine Unimodular Differential Geometry 


The length, curvature, and normal of a curve in the plane do not change under the 
motions of this plane. For a smooth curve, similar notions can be defined so that 
they remain invariant not only under motions but also under any unimodular affine 
transformations of the plane. (An affine transformation is said to be unimodular if 
it preserves area.) Such a “length” cannot be defined for a straight line; therefore, 
the smooth curves under consideration must not only have nonvanishing velocity 
vector but also possess a certain additional nondegeneracy property; namely, at each 
point, the velocity and acceleration vectors must be linearly independent. In affine 
differential geometry such curves are called nondegenerate. 


Problem 1.22 Prove that the linear independence of the velocity and acceleration 
vectors does not depend on the choice of the parameterization of a smooth curve. 
Does the vanishing of the acceleration vector depend on the parameterization? 


Problem 1.23 Prove that the points at which the velocity and acceleration vectors 
are linearly dependent are precisely the inflection points. 


In affine unimodular geometry the simplest curve (an analogue of a straight line 
in Euclidean geometry) is a parabola. First, we define affine length for a parabola, 
and then generalize this definition to an arbitrary nondegenerate curve. 

The definition of affine length for a parabola is based on the following property 
of a parabola. Suppose that the tangents to a parabola at points a, 6, and y form 
a triangle ABC (see Fig. 1.13). Then the areas of the three triangles each of which 
is formed by two tangents and the straight line joining the points of tangency are 
related by 


Vv Aapc + ¥ Apya = VAyaB- 
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Fig. 1.13 Tangents toa 
parabola 


Let us prove this relation. A parabola can be defined parametrically as r(t) = 
a+bt+ 5ct", where a, b, and c are fixed vectors. Consider the triangle bounded 
by the line r(t))r(t2), the line passing through the point r(¢,) in the direction of 
the vector r’(t;) = b+ ct,, and the line passing through r(f) in the direction of 
r'(t2) = b+ cho. 

Recall that, given two vectors x and y in the plane, their pseudoscalar product 
x A y equals the oriented area of the parallelogram spanned by these vectors. 
Pseudoscalar product can also be defined in a different way as follows. Consider 
the three-dimensional space containing the given plane; then the vector product of 
any two vectors in this plane is perpendicular to the plane. Therefore, the vector 
product of vectors x and y in a given plane can be regarded as the number x A y 
(with a sign). 

The oriented area of the triangle under consideration is expressed in terms of 
pseudoscalar product as follows: 


1 ¢@ =r) Ara): C@) =-rG)) ar) 
2 r(t) Ar'() , 


It is easy to check that (r (#2) —r(t1)) Ar’ (ty) = 5(cAb) (t2—11)? andr’ (ty) Ar’ (t2) = 
(b A c)(t2 — t1). Therefore, the (signed) area in question equals a(b A c)(t2 — t)?. 
Let the points a, 6, and y correspond to the parameters t, f2, and #3. Then the 


numbers ¥/Aggc, \/Agya, and /Ayqag are proportional to tf — ty, t3 — ft, and 
t3 — t1, respectively, whence the required formula. 
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Problem 1.24 


(a) Consider a smooth curve bounding a convex figure and a not too large arc of 
this curve (that is, an arc containing no two points at which the tangents to the 
curve are parallel). We define the distance between points A and B of this arc 
as (A, B) = </Aapy, where X is the intersection point of the tangents at A 
and B. Prove that the distance thus defined satisfies the triangle inequality. 

(b) Suppose that, whenever a point B of this arc lies between two other points A 
and C, the equality p(A, B) + p(B, C) = p(A,C) holds. Prove that then the 
arc is an arc of a parabola. 


Now we specify a parameterization of the parabolar(t) = a+bt+ xet? similar 
to the natural parameterization of a curve as follows. Let us define a “distance” 
between two points A and B of this parabola as the cube root of the area of the 
triangle ABX, where X is the intersection point of the tangents to the parabola at 
the points A and B. We have already seen that, up to proportionality, such a “natural 
parameter” coincides with the parameter t. Now we must choose an appropriate 


proportionality coefficient. Let us replace the paramelet t by s = At, where A is a 


dr __ dr a a dr 2d?r 
constant number. Then 9; = 7; - 7; = Ag, and 7 = 4°. Therefore, 


dr dr 3dr dr 
bAc=—A—7T=K—A-. 
dt dt? ds ds? 


Thus, setting A = /b Ac = ats , we obtain ~ A ds = |. The condition 
a A a = | is invariant with respect to unimodular affine transformations; hence 
the parameter s is the required analogue of the natural parameter. The absolute value 
of the difference of the parameters corresponding to two points of the parabola is 
called the affine length of the arc between these points. 

Affine length can be defined in a similar way for any curve r(t). We replace the 


parameter t by 


Let us check that this change of parameter yields % a = 1. Clearly, 4 a= a ft 
and ds = ds (at) + ue . ae Therefore, 4 A eS = a . 4 A a (at)? + a 
dtp Oo a ar. a - Bia = ae ye (at) It remains to note that 
-1 
(4) = (#045) 
We could take the equality ag “+ = | for the definition of the parameterization 


s, but then we would have is. oye “thie existence of such a parameterization (its 
invariance with respect to unimodular affine transformations is obvious). 
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Problem 1.25 Let As be the affine length of an arc AB, and let X be the 
intersection point of the tangents at the points A and B. Prove that as A tends to 
B, the ratio ¥/A,py/As tends to 1/2. 


The parameter s determining the affine length of an arc is similar to the natural 
parameter. An affine normal can be defined by using this parameter in precisely the 
same way as the usual normal is defined by using the natural parameter. Namely, 


: . ‘ 2 : 
the affine normal is the straight line parallel to the vector r” = . In particular, 
the affine normal to a parabola at any point is parallel to the axis of this parabola. 


Problem 1.26 Take a point A on a curve and draw a chord parallel to the tangent 
at this point. Prove that the affine normal is the tangent at A to the curve formed by 
the midpoints of all such chords. 


The affine curvature of a curve is defined as follows. The parameterization s is 
chosen so that r’ A r” = 1. Differentiating this equation, we obtain r’ A r’” = 0, 
because r” A r” = 0. Thus, the vector r’” is proportional to r’. The proportionality 
coefficient with the opposite sign is the affine curvature k(s): r’”(s) = —k(s)r'(s). 
In particular, the affine curvature of a parabola identically vanishes, because the 
vector r” is constant. 


Problem 1.27 Prove that k = —r’” Ar”. (It is assumed that the parameterization is 
chosen so that r’ Ar” = 1.) 
Problem 1.28 


(a) Prove that an ellipse is a curve of constant positive affine curvature and a 
hyperbola is a curve of constant negative affine curvature. 

(b) Prove that the affine normals to an ellipse and to a hyperbola pass through their 
centers. 


Problem 1.29 Given acurve y = f(x), prove that ds = (f”)'/3dx and 


a2 MW\—8/3¢ gl 2 1 MW\—5/3 gu 
a CA al a i ae 


1.12 Projective Differential Geometry 


Projective length can be defined for a smooth nondegenerate curve in the projective 
plane. The notion of a nondegenerate curve is the same as in affine differential 
geometry: a curve is nondegenerate if the velocity and acceleration vectors are 
linearly independent at each point. But defining projective curvature requires an 
additional nondegeneracy condition: the point at which curvature is defined must 
not be sextatic (see p. 33). 
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Before proceeding to curves in the projective plane, we discuss curves on 
the projective line, because they are a nontrivial object in projective differential 
geometry. A smooth curve on the projective line is a map y: R — RP!. The 
smoothness of the curve presumes also that the velocity vector vanishes nowhere. 
For a curve given in homogeneous coordinates (x;(t) : x2(t)), this means that 
xix2 _ XyX] # 0. Indeed, we can take x; /x2 or x2/x1 for the affine coordinate, and 
in both cases, the derivative of the affine coordinate with respect to ¢ is a fraction 
with numerator +(x}.x2 — x21). 

Curves y; and y2 are considered equivalent if y2(t) = g(yi(t)), where g is a 
projective transformation. 

There is a convenient description of equivalence classes of smooth curves on 
the projective line in terms of second-order linear differential equations. Consider a 
smooth curve y(t) = (x1(f) : x2(t)) on the projective line. The functions x(t) = 
x\(t) and x(t) = x2(t) satisfy the differential equation 


x" x’ x 
" / Sm 
Xp xy, x1) = 0. 
" / 
Xy X54 X2 


By condition, x4x2 _ X5X 1 4 0; therefore, this equation can be written in the form 
" / 
x” + pix + pox = 0. 


The solution space of this equation is two-dimensional. Let y; and y2 be two 
independent solutions. Then y; = a; x; for some matrix (a! ) with nonzero 
determinant. Hence the curve (y\(t) : y2(t)) is equivalent to the curve (x;(f) : 
x2(t)). Choosing various bases in the solution space, we obtain all curves equivalent 
to the given one. 

Note that to the same curve different differential equations correspond. A point 
of the projective line does not change when both of its homogeneous coordinates 
are multiplied by the same nonzero number. We set y;(t) = A(t)x;(t), where A is 
a nowhere vanishing function. If y = Ax, then y’ = A’x + Ax’ and y” = "x + 
2X'x' + Ax”. Therefore, 


y+ ay +qoy = Ax" + (20) + qiayx’ + A" + gid’ + q2d)x. 


If we choose q1 and q2 so that 2A’ + qjA = Ap, and A” + qi’ + qod = Apo, ie., 
_ 22 
qi = pi — = and 


a 2 ! 
= —-— + 2— — pi — 
q2 7 ar oo Pls + p2, 


then y will satisfy the equation y” + giy’ + q2y = 0. 
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The equations x” + pjx'+ pox = Oand y"+q1y'+q2y = 0 determine the same 
curve. To eliminate this ambiguity, we choose A so that p; — 2h = 0, ie., 91 = 0. 
Then 


je Tg 
a ie he a 


and the differential equation takes the form 


" 1 2 I ! 
y + QO(t)y=0, where Q=p2-— girl Pt 
Consider a curve of the form (A(t) f(t) : A(t)). Let us express Q for this curve 
in terms of f. First, we choose i so that both functions x1(t) = A(t) f(t) and 
x2(t) = A(t) satisfy the equation x” + Ox = 0. Then (Af)” + Q(Af) = O and 
A” + Qd = 0, whence 22’ f’ + Af” = 0. Therefore, Ind = —5In f’ + C, 1e., 
d= C(f’)~'/?. Thus, 


qe 1 pe 3 i 2 1 eis ! 1 ¥" 2 
ae ee Gy, “ola ica | 
The quantity 2Q is called the Schwarzian derivative of the function f and denoted 
by S(f). 


The functions f(t) and g(t) = aft determine equivalent curves on the 
projective line; hence S(g) = S(f). This is one of the main properties of 


Schwarzian derivative. 

Now we pass from curves on the projective line to curves in the projective plane. 
We begin with defining projective length and projective curvature for such curves. 
Consider a smooth curve y(t) = (x1 (¢) : x2(t) : x3(¢)) in the projective plane. The 
functions xj (t), x2(t), and x3(t) satisfy the third-order differential equation 


x" x” x! x 
Ww " / 

xy Xx, xy x1 
WW " / 

Xy Xy Xy XQ 
LA " / 

Xz X3 X33 X3 


=0. 


We assume that the coefficient of x’”” does not vanish, i.e., 


" / 

x, Xp 
" / 

XZ X5 XQ] FO. 
" if 

Xz Xz X3 
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This condition means that the curve has no inflection points. Indeed, e.g., in the 


x" x! 

1 “l| 4 0. If the curve has 
ae) 
no inflection points, then the corresponding equation can be written in the form 


affine coordinates (x; : x2 : 1) we obtain the condition 


x + pix” + pox’ + p3x =0. 
Choosing various sets of independent solutions, we obtain all curves equivalent to 
the given one. 
Now we must take into account the coincidence of points with proportional 


homogeneous coordinates. Let us replace the homogeneous coordinate x by y = 
47!x. A simple calculation shows that y satisfies the equation 


ay!” + (BA! + pid) + BA" + 2pid! + prdryy” +A" + pid” + pod! + p3d)y = 0. 


If we choose A so that 3’+ pA = 0, then the equation will take the form (we again 
write x instead of y) 
Mm" 


x” + Px’! + P3x =0, (1.6) 


where P) = p> — p’ — xp} and P3 = p3 — 3p + Pi — FP 1P2. 
Now consider how Eq. (1.6) transforms under the change of variables 


(t,x) (s=f(),y =e(t) |x). 
We denote differentiation with respect to s by a dot: 
x'= gly t gf's, 
x" = g"y + (2g'f' + of" \9 + 85, 
x" — gly + 3g" f’ + 39" f” + of” )y + a(t fy + ef f)¥ + y, 


We want the coefficient of ¥ to vanish; hence we must require that g/(f’ yr 
ef’ f” =0,ie., g =c/f’. Then y will satisfy the equation 


(FP V+ Pa 2SFYGIALPS/ F=f" Pal FFU S YP AAFP FY Ty = O. 
Thus, if f is a solution of the equation 
S(f) = 5 Pa 
2 
(and if we again replace y by x and s by ¢), then we obtain 


x” 4+ Rx =0, 
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R= (P5085) / (9%. 


For what follows, we need the quantity 


where 


1 / 
P= Ps— 5Py. 


It is used to define the projective arc length element ds = ~/Pdt. For a curve 
parameterized by s, P identically equals 1. 


Problem 1.30 Prove that if P = 0 at all points of some curve, then this curve is a 
conic. 


The points of a curve at which P vanishes are called sextatic. The order of contact 
of an osculating conic at a sextatic point is higher than that at a nonsextatic point. 

In a neighborhood of a point at which P ¥ 0, we can take the projective length s 
for a parameter. Let us write Eq. (1.6) in the form 


rl d 

a + 2k cea + hx, 

ie., put Py) = 2k and P3 = h. For the chosen parameterization, we have P = 1, 1.e., 
1= P3 — 5P% = h —k’, whence h = | + k’. Thus, the equation has the form 


x 4 2kx’ + (1 +k')x =0. 


The quantity k is called the projective curvature of the curve at the given point. 


1.13 The Measure of the Set of Lines Intersecting a Given 
Curve 


The topic which we will now discuss refers to integral, rather than differential, 
geometry, but it is very closely related to the considerations of this chapter. We 
will obtain a formula relating the measure of the set of straight lines intersecting 
a given closed curve to the length of this curve (in doing so, we have to take into 
account the multiplicity of each line, that is, the number of its intersection points 
with the curve). Before deriving the formula, we must define a measure on the set 
of lines in the plane. 

To each straight line in the plane not passing through the origin we can assign the 
following two parameters p and @. From the origin O we draw the perpendicular 
O P to the line; the parameter p is the length of the segment O P, and @ is the angle 
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of rotation from the ray Ox to the ray OP. The line with given parameters p and 6 
is determined by the equation x cos@ + ysin@ = p. 

Thus, we have to define a measure on sets of points in the plane with coordinates 
(p, 9), where p > Oand0 < @ < 2z. Of course, we are interested in an invariant 
measure, which is preserved by the motions of the plane. First, we discuss how 
invariant measures in the plane look like. We will calculate the area A(S) of a set S$ 
by the formula 


A(S) = II foi pdedy: 


where f(x, y) is some function. For what functions f is the area thus defined 
preserved under motions? Let us show that only for constants. Consider a motion 
(x, y) (x, y) given by 


x=a+xcosg—ysing, y=b+xsing+ycos@¢ (1.7) 


(any direct motion of the plane can be specified in this way). Suppose that this 
motion takes a figure S to a figure S. We are interested in those functions f for 


which 
If FG, azay = ff f(x, y)dx dy, 


whatever the motion and the figure. By the change of variables formula for multiple 
integrals 


O(x, eee 
II fis. yaxdy = ff RG G a ea 


a(x, y) 
where 
a(x,y) _ |S x __|cosy —sing| _ , 
a(x, y) ay x sing cos@ : 
Thus, 


ee ee) ee SS ees 
_ f(a, ¥), VX, y)) —=—dxdy = Jf f(x, y), y@, y))dx dy; 
AY d(x, y) Ss 


therefore, we are interested in functions for which 


a FG Havas = ff FOG. 5), 9G, Dd d5 
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whatever the figure S and the motion. Since the above equality holds for any figure, 
it follows that, for any motion, we have 


F(x, Y), VX, Y)) = FA, Y)s 


so f(x, y) = const, because any point of the plane can be mapped to any other point 
by a motion. 

Now we can introduce a measure on the set of lines. The motion (1.7) takes each 
line x cos @ + ysin@ = p to the line 


x cos(@ — gy) + ysin(@ — yg) = p—acosg — bsing. 


Thus, if we associate points with coordinates (p, @) to lines, then the motion (1.7) 
takes each point (p, 0) to the point (p — acosg — bsing, 6 — ¢). It is easy to 
check that the Jacobian of this map equals | and any point corresponding to a line 
can be taken to any other point corresponding to a line by such a map. Therefore, it 
is natural to define the measure of a set S of lines as |'/' 5 4p dé. The lines passing 
through the origin can be ignored, because these lines form a set of measure zero. 


Theorem 1.8 Let y be a smooth curve of length 1. Then the measure of the set of 
lines intersecting the curve y (with multiplicity taken into account) equals 21. 


Proof First, consider the case where y is a straight line segment of length /. Since 
the measure is invariant with respect to motions, we can assume that the midpoint 
of this segment is at the origin and the segment itself lies on the Ox axis. Then the 
measure of the set of lines intersecting y equals 


2a (1/2)| cos 6| 20 ] 
[[aao= | / dp a= | —|cos@|d0 = 21. 
0 0 0 2 


Next, consider the case where y is a polygonal chain of length / with i links; let 
1; denote the length of the th link. In this case, a line may intersect several links, 
and such a line must be counted with weight n = n(p, 0) equal to the number of 
intersection points. (A line intersecting the chain in a vertex intersects two links, but 
such lines form a set of measure zero.) Applying the formula derived above to all 
links of the chain and summing, we obtain 


[[ndvas=2d1 = 21, 


1 


The general case is proved by passing to the limit. oO 


HISTORICAL COMMENT The first formulas of integral geometry were obtained by 
Morgan William Crofton (1826-1915). 
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1.14 Solutions of Problems 


1.1 If the endpoints of the chord have coordinates (x;, y,) and (x2, y2), then the 
point M has coordinates 


_ bx, +.ax2 _ by, +ay2 


a+b > at+b 


Suppose that, as ¢ varies from fo to tj, each of the points (x1(f), y:(f)) and 

(x2(t), y2(t)) traces the whole curve y. According to formula (1.1), the area A 
bounded by y equals hs yjdx, = j. y2dx2. Therefore, 

[ (ab + b*)yidxi + (a? +ab)yodxr _ (abt b*)A+(@ +ab)A _ , 

to (a+b) 7 (a+b) —— 


Let us subtract the area A’ bounded by y’ from this expression for the area A. 
Clearly, 


Ala [° Gut aybdes + adr) 
1) (a + b)2 
Therefore, the desired area A — A’ equals 


ab t 
(a+by [ (y2 — y1)(dx2 — dx). 


Let us check that ie (y2 — y1)(dx2 — dx,) = m(a + b)?. This integral equals 
the area bounded by the curve which is traced by the point with coordinates (x2 — 
x1, y2 — y,). But this figure is a disk of radius a + b, because the distance from the 
point with these coordinates to the origin equals the length of the given chord. 


1.2 A circle of radius R can be specified by the parametric equations x(t) = 
Reosot, y(t) = Rsinawt. Then v(t) = (—@Rsinwt, wRcosat). For w 
1/R, we have ||v(t)|| = 1, ie., ¢ is the natural parameter. Moreover, a — 


—w(cosat, sinwt) andk = Is Fl = =o=1/R. 


1.3 If s is the natural parameter, then ae — 4”. ds and |% L| = 1. Therefore, 


ds dt 
2 
(a) = 


we obtain 


2 
td = y” and oy =y': (y?)-/2, Differentiating with respect to f, 


"y 72 


dy ds_ oy" yy’ _ "y=, vv" 


ds? dt y? “Gy ee 
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Let us raise this equation to the second power. Taking into account the equations 


2 
d’y = k2 ds\? _ ,/2 P 
ds? | = k* and (5) = y’”, we obtain 


2 7 ey _ (y’, y"y _ (x! a yl) (x? As 9) = (x""x! ae y"y’)? 
ne) rn (x? + 793 | 


This expression is easy to reduce to the desired form. 


1.4 Consider the following parameterization of an ellipse: x(t) = acost, y(t) = 
bsint. We have 


xy! — y"x! = —ab(cos’ t + sin’ t) = —ab, 


2 2 : 
x! + y'* = a’ sin? t + b? cos’ t. 


Therefore, according to Problem 1.3, 
2 ab? 
~ (a2 sin? t + b2 cos? t)3” 


1.5 We can assume that the point of tangency is the origin and the common tangent 
vector has coordinates (1, 0). Then the center of the circle is point (0, R), so that 
the points (x, y) inside the circle are those for which xe + (y - Ry < R?, ice., 
x*+4y* < 2Ry, and the points outside the circle are those for which x7+ y? > 2Ry. 

Let y(t) = (x(t), y(t), and let the point of tangency correspond to t = 0. Then, 
by assumption, (x’(0), y’(0)) = (1,0), whence x(t) = t + 5x”1 + O(t) and 
y(t) = sy"? +00); here x” = x” (0) and y” = y”(0). Therefore, x?(t)+ y(t) = 
t? + O(t?) and 2Ry(t) = y"t? + O(t?). 

Applying the formula of Problem 1.3, we obtain k* = (y”)*. In view of the 
direction of the tangent vector, we have k = y” rather than k = —y” (for small r, 
the curve lies in the upper half-plane if k > 0 and in the lower one if k < 0). Thus, 
2Ry(t) = kRt* + O(t3). Therefore, if KR > 1, then, for small r, the inequality 
2Ry(t) > x?(t) + y(t) holds and the point (x(f), y(f)) is inside circle, and if 
KR < 1, then this point is outside the circle. 


1.6 The velocity vector is directed along the support line. Let v = (— sing, cos g) 
be the unit velocity vector, and let n = v’ = (— cos@, — sing) be the unit normal 
vector (see Fig. 1.2). Clearly, n’ = —v. 

Let s be the natural parameter. Since a = n and oe = kn, where k is the 


curvature, it follows that gs = i = R, where R is the radius of curvature. 
Differentiating the equation p = —(n,r), we obtain p’ = (—n’,r) — (n,r’) = 
(v,r) — (n,r’). The vector r’ is parallel to v, so that (n, r’) = 0 and hence p’ = 


(v,r). Differentiating, we obtain p” = (v’,r) + (v,r’) = (n,r) + (v. ar) = 
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—p+ (v, ge. 4). Here a =vand = a = t = R. Therefore, p’ = —p+ R, ie., 
R=p+p". 

1.7 We can assume that the curve is given by an equation y = f(x) and f’(0) = 0. 
We are interested in the center of curvature at the point (0, aK (0)). The normal to 
the curve at a point (e, f (e)) is determined by the equation (x — €) + f’(e) (y - 
f (e)) = 0, and the normal at the point (0, f (0)) is the coordinate axis Oy. These 


normals intersect in (0. flt+ zx): Letting ¢ tend to 0, we obtain the point 


(0. fO)+ rin): This is precisely the center of curvature. 


1.8 For a curve without self-intersections, we have i kds = +2m by virtue of 
Umlaufsatz. If the curve is convex, then f %4 [Ali Orel i k ds. Hence either 
Sf? dk +k) ds = Oor fda —k)ds = 0. But |k| +k > 0. Therefore, in the former 
case, we have k = —|k| for all points of the curve, and in the latter, we have k = |k|; 
thus, the sign of the oriented curvature k is constant. 

Conversely, suppose that the oriented curvature k of a closed curve without 
self-intersections has constant sign. Then ihe Ikids = + p? kds = 2n, ie., 


i n(g)dg = 27. The velocity vector of a curve without self-intersections makes 
a full turn; therefore, n(y) > 1 for all gy. Thus, n(g) = 1 for almost all g. But it the 
curve is nonconvex, then n(g) > 1 for ¢g in a set of nonzero measure. 


1.9 Let us introduce the coordinate system whose axes coincide with sides of the 
given right angle. The straight lines in question intersect the coordinates axes at the 
points (wa, 0) and (0,a/a), where a > 0. The line corresponding to a parameter 


a is given by the equation x + a*y = aa. Two lines with parameters a and a2 
aa\a2 a 
ay+a2’ ay+a2 


intersect in the point with coordinates ( ); Asa, > a andar > a, 


this point tends to (4 om #). Such points lie on the hyperbola xy = a*/4. 


1.10 We can assume that O is the origin, A = (1,1), and B = (—1,1). Then 
A, = (1—-a, 1—a@) and B; = (—a, a) for somea € [0, 1]. The line passing through 
the points A; and B, is determined by the equation (2a — 1)x + y = 2a(1 — a). It 
is easy to check that if (xo, yo) is the intersection point of the lines corresponding to 
parameters a; and a2, then x9 = 1—a@, —a2. Asa, — a anda2 — a, the point xo 


tends to 1 — 2a. Substituting the expression a = — into the equation of the line, 

we see that the envelope is determined by the equation —x? + y = (1 — x”), ie., 
1 2 

y= . 


1.11 Take the coordinate system with axis Oy pointing vertically upward and axis 


Ox directed along the horizontal component of the velocity vo. Then, at a moment 
2 
of time f, the particle has coordinates x(t) = vot cosa, y(t) = vot sina — &, 
where a is the angle at which it was we For example, if the particle was shot 
vo 


vertically upward, then y(t) = vot — %-, and hence y(t) is maximum at t = Z 


1.14 Solutions of Problems 39 


2 
and y(t) = z. If the envelope is a parabola, then it must be defined by the equation 


2 
y= 3 — kx*. To find k, we calculate the maximum possible coordinate of the 


intersection point of a trajectory with the Ox axis (shot range). If y(t) = O and 


Si 2v2 sina cosa v2 sin 2a 
to # 0, then to = so and x(to) = ee = 0 5 . Therefore, the 


2 
maximum range of shot equals 2. Accordingly, for k, we obtain the equation y = 
7) v2 : 
xt (2) = 0, whence we find k = ~%. 
& 2u9 


2 
Now let us prove that the parabola y = x - sox? is indeed the envelope of the 


family of trajectories. To this end, it suffices to show that any trajectory lies below 
this parabola and shares one point with it, i.e., for all t, the inequality 


vy gcos?a 5 : e 
— — +r 2 vot sina — — 
2g 2 


holds, and it turns into an equality at some t. Replacing cos* a by 1 — sin? a, we 


obtain the inequality 


2 2 


uv sin’ a 

ere ee Si 

2g 2 
. g vo . 2 . 7 vO 
Le., 5 (2 —tsin ) > 0. It turns into an equality att = ane 


1.12 Consider a segment with endpoints (a;,0) and (0, b;) of length / and a 
segment with endpoints (a2, 0) and (0, b2) of the same length /. If a = ae and 
b= Ath then a =at+a,b, =b— B,az=a-—a,andb2 = b+ B for some a 
and f. The equation (a + w)* + (b — B)* = (a—a)* + (b+ f)* gives aa = bf. 

Let us find the coordinates of the intersection point of the segments under 
consideration (we assume that a; and a2, as well as b; and b2, have the same sign). 
The straight lines containing these segments are determined by the equations 


x + y 1 x y 


—_— = 1, ——{— 1s 
ata b-— 8 faa be 


Considering the sum and the difference of these equations, we obtain 


ax by ax By 


age Bee a—o2  b2—p2’ 
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Taking into account the relation aa = bf, we can rewrite the last equation in the 


% _ y ee aches ; 
form Waa) = bop" Substituting it into the first equation, we see that 
a(a* — a”) b(b? — B”) 
aS = 
a* + b? ae 


We are interested in the limit as a — O and f — O. Clearly, the limit expressions 
are x = a>/I? and y = 53/17. Thus, 


34 2/3 = (24 py/43 = 2/ PB =P. 


1.13 In the case where R = 4r, we obtain the following parametric representation 
of the hypocycloid: 


x = 3rcost+rcos3t = 4r cos’ t, 


y = 3rsint —rsin3t = 4r sin’ t, 


whence x7/3 + y?/3 = (4r)?/3 = R?/3, 
1.14 


(a) Consider A = e!%, B = e!*?, A’ = et, and B’ = et" Let C be 
the limit position of the intersection point of the straight lines AB and A’B’ as 
a — 0. Clearly, if k > 0, then the point C lies on the segment AB, and if 
k < 0, then it lies outside this segment. Let us show that AC : CB = 1: |k|. 
Indeed, AC : CB’ = sinB’ : sinA = +sina : sinka ~ 1: +k andCB’: 
CB = sin B : sin B’ — 1. We conclude that each point C on the envelope has 
coordinates 


efkO + kel? 1 
1+k 1+k 


(coskg+kcosg, sinkg+ksing). 


It is seen from the parametric representation of a hypo- and an epicycloid (see 
p. 19) that the points with such coordinates form a hypo- or epicycloid. 

(b) Answer: |k — 1|. The cusps correspond to the pairs of diametrically opposite 
points e'? and e!*, for which e!? + e'*¥ = 0. Canceling out e'?, we obtain the 
equation e k-De — _1, It has |k — 1| solutions. 


1.15 We will assume that the rays are parallel to the axis Ox and that they are 
reflected from the unit circle. Consider the ray falling to a point A = e!”. After 
reflection this ray falls to a point A,, which is obtained as follows. Let A’ be the 
starting point of this ray of light, i.e., the point symmetric to A with respect to the 
axis Oy. Then A, is symmetric to A’ with respect to the diameter AB. A simple 
calculation of angles shows that Ay = eS¥+™) Let y = gy +a. Then3y+z7 = 
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Fig. 1.14 Cardioid 


St 


39 + 3a +m = 39+ a fora = —4. Asa result, we find ourselves in the situation 


of Problem 1.14 with k = 3, which corresponds to Aut = 3,1e., R=2r. 


1.16 We assume that the circle is unit and A = (—1, 0). Then the ray falling to a 
point e’” falls to the point e'?”+”) after reflection. Let y = gp+a. Then2y+z = 
29 +2a+ 2 =2¢+a fora = —z. As aresult, we find ourselves in the situation 
of Problem 1.14 with k = 2, which corresponds to an epicycloid with one cusp. 


1.17 Consider the circle S; equal to the given circle S and tangent to it at the point 
A. We mark the point coinciding with A on the circle $; and roll S; on the circle 
S, keeping the latter fixed. Suppose that, at some moment of time, the circles are 
tangent at a point P and the marked point of S) is A; (Fig. 1.14). Then the arcs PA 
and PA, are equal, and hence so are the segments PA and PA,. At the moment 
under consideration the motion of S; is the rotation about the point P; therefore, the 
velocity of the point A; is perpendicular to the straight line P Aj, i.e., the velocity 
vector of the curve traced by the point A; is tangent at A, to the circle of radius 
PA centered at P. The point A, traces a cardioid, and this cardioid is tangent to all 
circles in the family under consideration. 


1.18 Suppose for definiteness that the curvature radius of a curve y(s) is positive 
and decreases from a point s; to a point s2. Consider the points E(s,) and E(s2) 
being the centers of curvature corresponding to the parameters s; and s2 (see 
Fig. 1.15). The length of the arc of the evolute between E(s;) and E(s2) equals 
R(s1)— R(S2). Thus, the distance between the centers of osculating circles is smaller 
than the difference of their radii. Therefore, the circle of smaller radius lies strictly 
inside the circle of greater radius. 
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Fig. 1.15 Tait’s theorem 


1.19 
(a) Given a point (xo, yo), consider the function 
F(t) = (xo — acost)? + (yo — bsint)? 
on the ellipse. We are interested in (xo, yo) for which the function F(t) has 
degenerate critical point f9. At such a point, we have F’(to) = O and F” (to) = 0, 


1.€., 


xoa sin tg — yob cos to + (b> _ a’) sin to cos fg = 0, 


Xo0a cos to + yob sin to + (b? _ a’)(cos” to — sin? to) = 0. 
Solving this system of equations, we obtain 


a —b 3 a —b - 3 
cos” fo, yo=— sin 


to. 


(b) It is seen from the expressions for focal points that these points lie on the curve 


(ax)23 + (by)2/3 = (a? — b)?/3. 
1.20 Given a point (xo, yo), consider the function F(t) = (x9 — t)? + (yo — t7)? 
on the parabola. We are interested in (xo, yo) for which F’(to) = 0 and F” (fo) = 0, 
1.€., 


418 + 2t0(1 — 2yo) — 2x0 = 0, 
12% + 2(1 — 2yo) = 0. 
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Sy 


Fig. 1.16 Cycloid 


Solving this system of equations, we obtain 


1+ 616 
=. 


xo = —44. 


YO 


1.21 Suppose that the fixed line is the Ox axis and the marked point coincides with 
the origin O at the initial moment. We also assume that the circle has radius | and 
remains in the upper half-plane. Suppose that, at some moment of time, the marked 
point is at a point X; then the rolling circle S; is tangent to the Ox axis at a point 
P (see Fig. 1.16), and the length of the segment O P equals that of the arc PX. Let 
PQ be the diameter of the circle S;, and let O; be the point (z, 0), i.e., the midpoint 
of the segment with endpoints at the first and second positions of the marked point 
on the Ox axis. Then the length of the segment O; P equals that of the arc OX. 
Therefore, if S| is the circle symmetric to S; with respect to P and X’ and Q’ are 
its points symmetric to X and Q, then the length of the arc Q’X’ equals that of the 
segment O, P. Consider the straight line / through Q’ parallel to the given fixed line. 
Let O’ be the projection of the point O; on/. Then the length of the segment O; P 
equals that of the segment Q’O’. Therefore, the length of the arc Q’ X’ equals that 
of the segment Q’ O’, i.e., the point X’ lies on the cycloid traced by a marked point 
on a circle of radius 1 rolling on the line / toward the initial circle (this marked point 
coincides with O’ at the initial moment). 

Let us show that the line XX’ is simultaneously a normal to the first cycloid and 
a tangent to the second. The point P is the instant center of rotation of the point 
X; therefore, the line PX is normal to the first cycloid. The point Q’ is the instant 
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center of rotation of X’; therefore, the line Q’X’ is normal to the second cycloid, 
and the line PX’ is tangent to it. 


1.22 Letr’ andr” be the velocity and acceleration vectors of the curve r(t). Let us 
calculate the velocity and acceleration vectors for the same curve with a different 


parameterization r(t(s)). Clearly, oe = a : gt =r’. ge and 


@r d([, at yfaty ..» &t 
— = —[r’.—)=r"(—) +r -—. 
ds? ds ds ds ds? 


By assumption both parameterized curves are smooth, Le., gt # QO. Hence the 
2, - : : 

vectors a and a are linearly dependent if and only if so are the vectors r’ and 
r”. 5 

The acceleration vectors for the two parameterizations are r” and r” (4) +r’. 

2 . F . 

rae The parameterization can be chosen so that one of these vectors is zero and the 
other is nonzero. 


1.23 Choose a coordinate system and a parameterization so that the curve has the 
form r(t) = (t, f(t)). Then r’ = (1, f’) andr” = (0, f”). The vectors r’ and r” at 
a point (to, f (to)) are linearly dependent if and only if f” (to) = 0, i.e., this point is 
a point of inflection. 


1.24 We use the notation of Fig. 1.13 (assuming that it shows an arbitrary convex 
arc). Let 


aC a cB Ob BA c 


CB. 1-a BA 1—b’ Ay 1-c 


Then Agge = 2 Aasc = {4c — a)Ayas = abcAyag and Agya = (1 — 
a)(1 — b)(. — c)Ayap. Thus, we must prove that 


Vabe+ JA —a)\i—b)\(1—o) <1. 
This easily follows from the arithmetic-geometric mean inequality: 


b 1 Popa 
Jabe + JT= ay By = 0) < Eg A a, 


(b) It is seen from the solution of problem (a) that p(A, B)+(B, C) = p(A, C) 
if and only if a = b = c. For us it is only important that a = c. Indeed, 
according to Problem 1.10 (see p. 17), given a fixed angle aBy, the envelope of 
the corresponding family of segments AC is an arc of a parabola. 


1.25 Let A = r(s), and let B = r(s + As) =r +r/As +r” 42 +... where 
r=r(s),r/=r'(s), andr’ =r’(s). Then X =r+dAr’ =r(s+As)+pr'(s+As) 
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for some numbers A and ju. Clearly, r’(s+ As) = r’+r”As+.... Therefore, Ar’ = 
r'As + ri As)” + ur’ + pr’ As +... . Taking into account the linear independence 
of the vectors r’ and r”, we obtain A © yx+ As and w+ a ~ 0. Thus, A * As. and 
hence the area of the triangle ABX approximately equals the area of the triangle 


" (As)? 
2 


with vertices r, r + r/ Ss, andr +r/As +r . The area of this triangle is 


es A (r’As + ee =r A Aad = ay 
2 2 2 8 8 


because the parameter s is chosen so that r’ A r” = 1. 


1.26 Choose the parameter s determining the affine arc length so that A = r(0). Let 
Oxy be the coordinate system with origin at A with respect to which r’(0) = (1, 0) 
and r” = (0, 1). Then in a neighborhood of A the given curve is defined by the 
equations x(s) = 5+ ais? +ays++... and y(s) = 5s" +bis?+bos+ +... .The 
first equation gives s = x+ceyx3+cox4+...; therefore, y= 5x? +dx3+dox4+. oie 
Let us compare the curve y = 5x" + d)x3 + dyx*+ +... with the curve y = 5x. 
The distance between the points of these curves in the vertical direction is of order 
x3: hence the distance in the horizontal direction is of order x? as well. Therefore, 
the curve formed by the midpoints of the chords under consideration deviates from 
the Oy axis by a distance of order x* ~ y*/*. Such a curve is tangent to the Oy 


axis, which is precisely the affine normal, because it is directed along r”. 


1.27 The pseudoscalar multiplication of both sides of the equation r’” = —kr’ by 

r” on the right yields r’” A r” = —kr’ Ar" = —k. 

1.28 

(a) An ellipse can be represented in the parametric form r(t) = acos(At) + 
bsin(At)+c, where a, b, and c are constant vectors and A is an arbitrary nonzero 
number. A simple calculation shows that r’ Ar” = 43a Ab andr” = —2?7r'. 


Choosing 47 so that A7a A b = 1, we obtain the required parameterization. 
Moreover, k(s) = 4? is a positive constant. 
For a hyperbola, we consider the parameterization r(t) = acosh(At) + 
b sinh(At) + c, choose A so that A?7a A b = 1, and obtain k(s) = —A?. 
(b) The midpoints of the chords parallel to a diameter in an ellipse or a hyperbola 
lie on the conjugate diameter. Applying Problem 1.26, we obtain the required 
result. 


1.29 First, we take x for a parameter. Then r(x) = (x, f(x)), r’(x) = C1, f’(x)), 
and r(x) = (0, f”(x)); therefore, r’ Ar” = f(x). By definition ds = (r’ A 
r")'/P dx, whence ds = (f”)!7dx. 

To calculate k, we must pass to the parameterization by s. The formula dy a 
kf shows that it suffices to calculate the first coordinate of the vectors as and a 
Clearly, $¢ = r'4 = (f”)—'/3r'; therefore, the first coordinate of $ is (f”)~ 1/3. 
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. 2 
Next, the first coordinate of cs equals 


d 
als aoa ee 
S 


1 1 
Ci Ws Cl Ban ak = Ce Mee 


i) 
Finally, the first coordinate of the vector of equals 


d_ 1 1 
ee ed vee Geek 
1 5 
= Sa ER yee a (Ge ante dae 


eat : 3 : J 
Dividing the first coordinate of om by the first coordinate of a, we obtain the 
: S S 
required result. 


1.30 For a certain parameterization ¢ of the curve under consideration, the curve is 
given by the equation x” = 0. This equation has the three independent solutions 1, 
t, and t*. Hence the curve is projectively equivalent to the conic (1: t : t7). 


Chapter 2 ® 
Curves in Space od 


The next simplest object of differential geometry after a plane curve is a curve 
in space, three- or many-dimensional. In addition to curvature, a curve in three- 
dimensional space has one more characteristic, torsion. A space curve lies in one 
plane if and only if its torsion identically vanishes. A curve in n-dimensional space 
is characterized by numbers x1, ..., %,—1, which generalize curvature and torsion. 
Again, a curve lies in one hyperplane if and only if x,_; = 0 at all points of this 
curve. 


2.1 Curvature and Torsion: The Frenet—Serret Formulas 


Let y : [a,b] > IR? be a smooth curve, ice., y(t) = (x (t), y(@0), 2), where x, y, 


and z are smooth functions and v(t) = av@) # 0 for all ¢ € [a, b]. It is convenient 


to replace the parameter ¢ by the natural parameter s = s(t) = i, |v(t) || dt, for 
which ||v(s)] = | ay | 24, 


Suppose that ae # 0 at each point of y. Then to each point of y we can assign 
the orthonormal frame e;(s), e2(s), e3(s) defined as follows. First, e;(s) = v(s). 
The vector e2(s) is uniquely determined by the condition oe (s) = k(s)e2(s), where 
k(s) > 0. The head of the vector e;(s) moves on the unit sphere, so that e2(s) 
is orthogonal to e;(s). The pair e;, e2 of orthogonal vectors can be extended in a 
unique way to a positively oriented orthonormal basis by adding a third vector, and 
e3 is this third vector. The frame thus obtained is called the Frenet—Serret frame. 
The vector e2 is called the principal normal to the curve. 

In what follows, we usually consider regular space curves y, which satisfy the 
condition oe # 0 at each point, because the Frenet—Serret frame exists only for 
regular curves. 
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e1(s) €1 (so) 
Clearly, | e2(s) | = A(s) | e2(so) |, where A(s) is an orthogonal matrix. Let us 
e3(s) €3(s0) 


show that the matrix A’(so) = #4 (so) is skew-symmetric. Indeed, A(s)A(s)" =/ 
is the identity matrix. Therefore, A’(s)A(s)’ + A(s)A‘(s)’ = 0. Moreover, A(so) = 
I, whence A’(so) + A’(so)! = 0. Thus, 


et O a2 a13\ fer 
é, =[-aj2 O ar en]. (2.1) 
e; —a13 —a23 0 3 


But in view of the choice of the vector e2, we have e; = kep. Therefore, aj. = k 
and a1}3 = 0. The number k is called the curvature of the given space curve, and 
the number x = a3 is called the torsion of this curve. Curvature is defined for any 
smooth curves, while torsion is defined only for regular curves. 


HISTORICAL COMMENT The curvature and torsion of a space curve were intro- 
duced by Michel Ange Lancret in his 1806 paper Memoir on Curves with Double 
Curvature. He defined them in terms of infinitesimal angles of rotation of the 
normal and osculating planes. In the nineteenth century the name ‘curves of double 
curvature’ was used for space curves. 


Formulas (2.1) can be written in the form 


e} = Keo, 

es = —ke, + e3, 
/ 

e3 = —#e2. 


These formulas are known as the Frenet—Serret formulas. 


HISTORICAL COMMENT Recall that Serret and Frenet derived these formulas for 
curves in three-dimensional space almost simultaneously. 


Problem 2.1 Suppose that all normal planes to a curve y(s) pass through a fixed 
point xo € R°. Prove that this curve lies on a fixed sphere centered at xo. 


Problem 2.2 Calculate, up to sign, the curvature and the torsion of the helix y(t) = 
(Reost, Rsint, ht). 


Problem 2.3 


(a) Prove that if the curvature of a smooth curve identically vanishes, then this curve 
lies on a straight line. 

(b) Prove that if the torsion of a regular curve identically vanishes, then this curve 
lies in a plane. 
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Problem 2.4 Prove that the curvature of a curve y(t) = (x (t), y@), z(t) with any 
parameterization f is calculated by the formula 


2 _ (y’ x y")? 
re (y’?)3 
Problem 2.5 
(a) Prove that the torsion of a curve y(s) = (x(s), y(s), 2(s)) with natural 


parameterization s is calculated by the formula 


(y’ yn yl") 
a ; 


where (y’, yy", v'") = (vy, vy” xy”) is the scalar triple product of three vectors. 
(b) Prove that the torsion of a curve y(t) = (x (t), v0), z(t) with any parameteri- 
zation t is calculated by the formula 


eee) 
— (y’ x yy") . 


Problem 2.6 Let y(s) be a curve with natural parameterization on the sphere of 
radius R centered at xo. Prove that k > 1/R and if x 4 0, then y(s) = xo — pe2 — 
p'oe3, where p = 1/k ando = 1/x. (In particular, R* = p* + (p'o)*.) 


Problem 2.7 Let y(s) be a curve with natural parameterization for which p = 1/k 
and o = 1/x are defined. Prove that this curve lies entirely on a fixed sphere if and 
only if 


Problem 2.8 A curve lies on a sphere and has constant curvature. Prove that this 
curve is a circle. 


In Problem 2.7 the torsion x is required to be nonvanishing. The following two 
problems give criteria for a curve to lie on a sphere which do not assume that x ¥ 0. 
Recall that, according to Problem 2.6, the curvature k of a curve on a sphere does 
not vanish. 


Problem 2.9 [Wo2] Let y(s), s € [0,/], be a smooth curve (with natural parame- 
terization) whose curvature nowhere vanishes. Prove that this curve lies on a fixed 
sphere if and only if there exists a smooth function f(s), s € [0,/], for which 


d {1 df x 
= —(- d —+-—=0. 
fu a (z) an + 


Prove also that the radius of the sphere in this case equals , / a + f?. 
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Problem 2.10 ({Br1, Wo2]) Prove that a smooth curve y(s), s € [0, /], with natural 
parameterization lies on a sphere if and only if 


S S 
(Acos [ xd + Bsin f x ds )k(S) =| 
0 0 


for some constants A and B, and the radius of the sphere then equals V A? + B2. 


A curve y(s) is said to be of constant slope if there exists a fixed unit vector e 
for which (e, e;) = const; here ey = a is the unit velocity vector. 


Problem 2.11 Prove that a curve with k # 0 is of constant slope if and only if 
»% = ck for some constant c. 


Let y(s) be a smooth curve, where s is the natural parameter. Suppose that k(O) # 
0. Then at the point y (0) the Frenet—Serret frame e1, e2, e3 is uniquely defined. We 
have 


y'(0) = e1, 
y" (0) = kes, 


y" (0) = ken + kel, = —k*e, + ken — knee, 


Calculating the derivatives of y can be continued, because the derivatives of the 
vectors e, €2, and e3 can be expressed in terms of the functions k = k(s) and 
x = x/(s). 

If a curve y(s) = (x!(s), x?(s), +7(s)) is defined by not merely smooth but 
analytic functions x 1 x2, and x3, then the above argument shows that the functions 
k(s) and x(s) determine the curve y(s) uniquely up to a motion of Euclidean space. 
The solution of Problem 2.12 shows that this is true not only for analytic functions. 


Problem 2.12 Prove that, for any two differentiable functions k(s) and x(s), 
where k(s) > 0, there exists a unique (up to a motion) smooth curve y(s) = 
Ge (s), x2(s), x3(s)) with curvature k(s) and torsion x(s). 


Suppose in addition that %(0) 4 0 and choose coordinates x!, x”, x? associated 


with the Frenet—Serret frame at the point y (0). It follows from the equations for 
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Fig. 2.1 The projections of the curve 


the derivatives of y(s) at zero (see p. 50) that, in these coordinates, y(s) = 
(x!(s), x2(9), x3(s)), where 


263 
ioe kh... 
6 
1 k's? 
2 2 
= x<k was 
x“(s) 5 Ks + 6 + 
kus? 
3 —— 
x?(s) = G +... 


Thus, the projection of the curve y(s) on the coordinate planes associated with the 
Frenet—Serret frame have the form shown in Fig. 2.1. 


2.2 An Osculating Plane 


Let y(s) be a regular space curve with natural parameterization. Take a plane 
passing through the point y (so). We denote the distance between the points y (so) 
and y (so + As) by d(As) and the distance from the point y (so + As) to the chosen 
plane by h(As). Let us try to choose the plane so that the curve would move away 
from it as slowly as possible, or, more precisely, so as to maximize the greatest 
positive integer m for which lima s_+o Fae = 0. Below we show how to choose a 
plane for which m = 2 and prove its uniqueness (essentially using the regularity of 
the curve). 
Let n be the unit normal vector to the chosen plane. Clearly, 


y" (so) 


k 
y (so + As) — y(so) = y'(so)As + a -» = Asey+ 5 (As)e2 + wees 
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Thus, A(As) is the absolute value of the dot product of the vectors n and 
Ase, + 5(As)*e9 + .... Therefore, to minimize h(As), we must choose n to be 
perpendicular to the vectors e; and e9, i.e., the required plane must be parallel to e, 
and e2. This uniquely determines a plane passing through the point y (sq). 

Now it is already easy to check that lima; ee = = 0 for the chosen plane. 
Indeed, h(As) = €1(As)* and d?(As) = (As)? + €2, where ¢; > 0 and ¢2 > 0 
as As > 0. 

Given a regular curve y(s), the plane passing through a point y (sg) and parallel 
to the first two vectors e; and e2 of the Frenet—Serret frame at this point is called an 
osculating plane. 

An osculating plane can also be found as follows. The plane passing through 
a point y (sg) and perpendicular to a vector n is determined by the equation (x — 
y (so), 1) = O. Let us substitute x = y(s) into the left-hand side and consider the 
function f(s) = (y(s) — y (so), n) thus obtained. It is required to choose the vector 
n So as to maximize the number of derivatives of f(s) vanishing at so. The equation 
f'(so) = 0 means that the vector n is perpendicular to e;, and f”(so) = 0 means 
that n is perpendicular to e2. 

In a similar way, given a curve y(s), we can construct the osculating sphere at 
a point y(so). But for this purpose it is required not only that the curve be regular 
(i.e., the curvature k be nonvanishing) but also that the torsion x be nonvanishing as 
well. The sphere centered at a and passing through a point y (sg) is determined by 
the equation ||x — al? — R? = 0, where R = ||y (so) — al]. We want to choose the 
center a of this sphere so as to maximize the number of derivatives of the function 
f(s) = lly (s)—a|l? vanishing at so (we throw away the constant, because it vanishes 
under differentiation). Clearly, 


f = 2(y —a, y’') = 2(y —a, ei), 
f" =AWy -a,y") + 2lly'W? = Wy — a, ker) +2. 


Then, taking into account the relations e; Le. and (ke2)’ = k’eo + k(e2)’ = k’e2 — 
ke, + kxe3, we obtain 


fl" =Uy - a4, Ken — ke, + kxe3). 


Let a = y(so) + per + geo + re3. The condition f’(so) = 0 means that p = 0, 
and the condition f” (so) = 0 means that —kq + 1 = 0, ie., g = 1/k. Finally, for 
a= rer + re3, the condition f’” (sg) = 0 means that Ke +rkx = 0. As aresult, we 
obtain that the required sphere is centered at 


(s y+ ! e2(s y- #0) 
aa ame C0 eee aT CCN) 


€3(So), 


and passes through the point y (so). It is called an osculating sphere. 
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2.3. Total Curvature of a Closed Curve 


Let y(s) be a smooth closed curve in R* with natural parameterization. We will 
assume that s € [0,/], where / is the length of y. The numbers Jo k(s) ds and 


fo x(s) ds are called, respectively, the total curvature and the total torsion of the 
curve y. 

To a curve y we can assign its velocity indicatrix by taking the velocity vector 
ta at each point of the curve and treating this vector as a point of the unit sphere. 
Thus, the velocity indicatrix is a closed curve on the unit sphere. The total curvature 
of a curve y equals the length of its velocity indicatrix, because k(s) ds = ||dv||. 


Theorem 2.1 (Fenchel [Fel]) The total curvature of a closed curve y is at 
least 27. 


First Proof First, we prove the following auxiliary assertion. 


Lemma /f any plane through the center of a unit sphere intersects a curve! w lying 
on this sphere in at least n points, then the length of w is at least nz. 


Proof To each point of w we assign the set of all planes which pass through this 
point and the center of the sphere, and to every such plane we assign the pair of 
points in which the sphere intersects the normal to the plane through the center of 
the sphere. As a result, an infinitesimal arc da will be assigned a spherical lune of 
angle da (that is, the part of the sphere captured between two planes which make 
an angle of da). Its area equals 4dq (the area of a spherical lune of angle z equals 
that of the entire sphere, i.e., 47r). Thus, the area of the figure assigned to a curve of 
length L equals 4Z, and the area of a domain covered m-fold is counted m times. By 
assumption each point of the sphere is covered at least n-fold. Therefore, 4L > 47n, 
Le., L > an. oO 


Now let us prove that any plane through the center of the sphere intersects the 
velocity indicatrix in at least two points (or in one double point, but the set of such 
points has measure zero). Indeed, the curve y can be enclosed by two planes IT; and 
TI parallel to the given plane in such a way that the planes IT; and I> are tangent 
to y at points y(s) and y(s2). Then the velocity vectors at these points are parallel 
to the given plane. This means that the velocity indicatrix intersects the given plane 
ins = sj ands = 5S. 

Applying the lemma with n = 2, we see that the length of the indicatrix is at 
least 277. Oo 


Second Proof Fix a positive integer n and consider the spatial n-gon with vertices 
A, = y(l/n), Az = y(2l/n), ..., An = y(l) = y(O). Let B} @ = 1,...,7) be 
the points in which the unit sphere intersects the translate of the ray Aj Aj+1 with 
origin at the center of the sphere. If n is large enough, then the points B; and Bj+1 


' This curve is not required to be closed or connected. 
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are not diametrically opposite, so that the spherical polygon B, ... By, is uniquely 
determined. For large n, this polygon is little different from the velocity indicatrix of 
y. Therefore, it suffices to prove that the sum of the lengths of sides of the polygon 
B,... By, is at least 277. The sum of the lengths of sides of the polygon B, ... By 
equals the sum of the exterior angles of the polygon A; ... Ay. Let g; be the angle 
of A; ... A, at the vertex A;. We are interested in the sum ) (7 —g;) = na —)° gj. 

Let us join the vertex A, with the vertices Az, A3, ..., An—2 by straight line 
segments. We obtain n — 2 triangles. The angles gy; and g,_1 are angles of these 
triangles, and each of the remaining angles but g, does not exceed the sum of two 
angles of these triangles (the angle g, does not exceed the sum of several ones). 
Therefore, the sum of g; does not exceed the sum of the angles of n — 2 triangles, 
Le., >> g; < (n — 2). Thus, )o(a — gj) = nx — >> y > 27, as required. oO 


Remark The total curvature of a closed curve y equals 27 only if y is a plane 
convex curve. 


HISTORICAL COMMENT Werner Fenchel (1905-1988) proved the theorem about 
the total curvature of a closed curve in 1929. 


Problem 2.13 


(a) Prove that if the length of a (connected) closed curve w on the unit sphere is less 
than 25 < 27, then the curve lies in a spherical disk? of spherical diameter 6. 
(b) Find a proof of Fenchel’s theorem which uses assertion (a). 


HISTORICAL COMMENT In his 1948 paper [Bo] Karol Borsuk (1905-1982) con- 
jectured that the total curvature of a knotted curve y is at least 47r. In 1949 this 
conjecture was proved independently first by Istvan Fary (1922-1984) [Fa2] and 
then by John Milnor (born 1931) [Mi2]. 


Theorem 2.2 (Fary—Milnor) The total curvature of a knotted curve in R°? is at 
least 47. 


Proof It suffices to prove that a plane in general position passing through the center 
of the sphere intersects the velocity indicatrix of a knotted curve in at least four 
points. (After that, the lemma in the first proof of Fenchel’s theorem can be applied.) 
A plane in general position intersects a closed curve in an even number of points; 
therefore, it suffices to prove that the number of intersection points is greater than 
two. Suppose that a curve y is enclosed by two parallel planes IT; and IT2 and these 
planes II; and Iz are tangent to y at points y(s1) and y(s2). Suppose also that 
no other plane parallel to IT; and Iz is tangent to y. Then the points y(s,) and 
y (sz) divide the curve y into two arcs, and as a point moves on any of these arcs, its 
distance to II; changes monotonically. This means that the curve y is unknotted. 

oO 


2 A spherical disk of diameter 4 is the set of points of a sphere whose spherical distance to a certain 
fixed point of the sphere (the center of the disk) is at most 5/2. 
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Problem 2.14 Prove that if the maximum curvature of a closed space curve equals 
k, then the length / of this curve satisfies the inequality 1 > 27/k. 


2.4 Bertrand Curves 


Curves y(s) and y*(s) are called Bertrand curves if, for each s, the principal normal 
to the curve y at the point y(s) coincides with the principal normal to y* at y*(s). 
We assume that s is the natural parameter for the curve y; for y*, this parameter is 
already not necessarily natural. 

It is seen directly from the definition that y*(s) = y(s) + ae2. Therefore, 


dy* dy ; dep 
i ae ae 


= (1 —ak)e, +a'e. + axe3. 


The tangent vector ay” must be orthogonal to the principal normal vector e2, whence 
a’ = 0, i.e., the coefficient a is constant. This means that the distance between the 
corresponding points of the Bertrand curves is constant. 

Let s* be the natural parameter for the curve y*, and let er. 5, e3 be the Frenet— 
Serret frame for this curve. Then the vector eF is orthogonal to 5 = +6), Le., 
e} = cos@e, + sinwe3. Hence 


dcos@ d sing 
e+ 


ds ds 


= en(kcosw — x sinw) + 3. 


By condition, the normal vectors 5 and é2 are collinear; therefore, the angle m must 


5 
be constant. Moreover, the vectors ay" = (1 —ak)e; + axe3 and ej; = coswe; + 
; : ama . |l—ak ax ; 
sin w e3 are collinear, which implies . = 0, ie., 
cos@ sinw 
ak sinw + ax cosw = sinw. (2.2) 


If sin@ = 0 anda ¥ O, then x = 0, which corresponds to plane curves. But if 
sin w 4 0, then, setting b = acotw, we can rewrite Eq. (2.2) in the form 


ak + bx =1. (2.3) 


It is possible to reverse this argument and show that condition (2.3) is not only 
necessary but also sufficient for the curves y and y* to be Bertrand curves. 
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HISTORICAL COMMENT Bertrand curves were studied by Joseph Bertrand (1822- 
1900) in 1850. 


2.5 The Frenet—Serret Formulas in Many-Dimensional Space 


We say that a curve y(s) = (x1(s),...,X,(S)) in n-space is regular if the vectors 
y'(s), y(s), ..., y“~(s) are linearly independent at each point s of this curve. 
At every point of a regular curve the Frenet—Serret frame e(s), ..., @n(s) can be 
defined; it is constructed as follows. The vectors e)(s), ..., @n—1(s) are obtained 
by applying the Gram—Schmidt orthonormalization process to the vectors y’(s), 
y"'(s), .0e, y"-Y(s), and the vector e,(s) completes these vectors to a positively 
oriented orthonormal basis. Recall that the Gram—Schmidt orthonormalization 
process generates the vectors e;, ..., €,—1 by induction; to obtain a vector eg, we 
take the vector Aye; +--+ +Ag—1eg-1 + rae where A; = —(e;, y™), and divide it 
by its length. Such a choice of the coefficients A; ensures the orthogonality of e; to 
the vectors e1,..., €g—1. 


The vector e; is a linear combination of y’, y”, ..., y”; therefore, the vector e; 
is a linear combination of y”, y’”, ..., y“+) and hence of e1, €2, ..., €/41- 
Consider the numbers x; = (e,, ei41) fori = 1,...,n —1.Ifi < n —2, then 


the number (e;, €j+1) is of the same sign as (yet), €i+1) > 0, so that %; is positive 
(the number x,—1 can have any sign and even be vanishing). 


Theorem 2.3 The derivatives of the vectors the Frenet-Serret basis vectors are 
expressed in terms of these vectors themselves as follows: 


O x O. 0 0 
el ef x, O m. 0 0 
: | =HkK , where K = Dia Oe 0 7 
en en 0 0 0 Os 2ec4 

0 0 0... —2%,-) 0 


Proof Let e; = kjje +--+ +-+kinen. Then (e;, e;) = kj. From the identity (e;, e;) = 
6;; it follows that (e!, e;) + (e, a) = 0, ie., kjj + kj; = 0. Thus, the matrix K 


is skew-symmetric. The vector e, is a linear combination of e1, ..., e;41; therefore, 
k;j =O for j > i+ 2. Since the matrix K is skew-symmetric, it follows that k;; = 0 
fori > j +2. Oo 


Corollary A curve lies entirely in a hyperplane if and only if x, = 0 at all points 
of this curve. 


Proof The identity x,—1 = 0 is equivalent to the constancy of the vector ey. If en is 
constant, then the vectors e1, ..., @,—1 lie in the hyperplane perpendicular to e,. For 
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our purposes, it is only essential that the velocity vector e; of the curve y(s) lies in 
this hyperplane. Indeed, this implies that y lies in the hyperplane perpendicular to 
the vector e, and passes through the initial point y (so). 


Now suppose that a curve y lies in a hyperplane. Then all vectors e1, ..., @n—1 
are parallel to this hyperplane. Therefore, the vector e, is perpendicular to it, i.e., 
constant. Oo 


The quantity ,—1 is called the torsion of the curve y in n-space. 


Theorem 2.4 For any positive functions (8), ..., %,—2(s) and any nonvanishing 
function x,—-\(s), there exists a unique (up to a motion) curve with natural 
parameterization in n-dimensional space for which the given functions are precisely 
the quantities through which the derivatives of the vectors in the Frenet-Serret basis 
are expressed. 


Proof Let e;(s), ..., €n(s) be the Frenet—Serret frame. Consider the matrix F(s) = 
(e1(5),..-, €n(S)) T The Frenet—Serret equations can be written in the form F’(s) = 
K(s)F(s), where K (s) is the matrix in the statement of Theorem 2.3. 

Now let us construct the matrix K (s) from the given functions x (s), ..., %,—2(s) 
and find a solution of the matrix differential equation F’(s) = K(s)F(s) with an 
arbitrary initial orthogonal matrix F'(sg). Let us show that all matrices F(s) are then 
orthogonal as well. We have (FF?) = F’F! + F(F’)’ = K(FF')+(FF‘)K’. 
Let X = FF’. The matrix differential equation X’ = KX + XK’ with initial 
condition X = I has a unique solution. Therefore, X = / is the solution of this 
equation, because K + K" = 0. It follows from the uniqueness of the solution that 
F(s)F(s)? = I forall s, i.e., all matrices F(s) are orthogonal. 

The matrix F(s) determines, in particular, the unit vector e;(s). Let y(so) = go 
be any initial point. Consider the curve y(s) = go + ie e1(t)dt. We have y’ = e). 
Therefore, it follows from F’ = KF that y” = e = x1e2 and y” = (x e2)' = 
weg + xe, = xe2 + #1 (—x1e1 + 4263) = (—x?2e1 + 212) + %122e3. Similar 
considerations show that the vector y“ equals the sum of some linear combination 
of the vectors e1, ..., ex—1 and the vector 2] 22 . . . *x-1ex. Note that we always have 
2122 ...#k—-1 A O, and hence the vector ex is determined uniquely. Therefore, e1, 
..., €n is the Serret-Frenet basis, and the given functions are precisely the quantities 
through which the derivatives of the Frenet—Serret basis vectors are expressed. O 


HISTORICAL COMMENT An analogue of the Frenet—Serret formulas for a curve in 
n-space was derived by Camille Jordan (1838-1922) in 1874. 
2.6 Solutions of Problems 


2.1 A normal plane is orthogonal to the vector e;, which means that (xo - 
y(s), e1) = 0. Therefore, 4 \Ixo — y(s)|?7 = —2(x0 —y(s), e1) = 0. 
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= JR2+h2, whence t = S___. where s 
AJ R2+h2 


Then ej = w(—Rsinws, Rcosws, h) 


’ d 
2.2 It is easy to check that | + 


1 


VJ Rh 


is the natural parameter. Let w = 


and e} = Ro*(— cos ws, — sinws,0). Therefore, k = Ra? = ao and ey = 

(— cosas, — sinws, 0). Extending the vectors e; and e2 to a positively oriented 

orthonormal basis, we obtain e3 = (hwsinws, —hwcosws, Rw). To find x, we 

write the equation es = —ke, + xe3 for the last coordinate: 0 = —kwh+ x Ra, ie., 
=_= REE 

2.3 


(a) Let y(s) be a curve for which k = 0, i.e., 1 = = 0. Then 4 + = e, = const. 
Therefore, y(s) = y(O) + se, ie., the a y lies on the line through y (0) 
parallel to the vector e;. 

(b) Let y(s) be a curve for which x = 0. Then e3 = ponst. The vector e3 is 
orthogonal to e; = - Therefore, 4 (63, y) = (63, a) = = 0, Le., (63, y) = 


const. This means that the curve y lies in a plane orthogonal to the vector e3. 


2.4 In the same way as in the solution of Problem 1.3 we obtain 


Py ay"? 


-_ (y’?)3 
Moreover, for any two vectors a, b € IR}, we have a*b* — (a, b)* = (ax b). 
2.5 


(a) The equation e, = —xe2 shows that |x|? = (eo, ee Moreover, e2 = iy’ ’ and 
B= ty’ x y”. Hence 


1\’ 1 1 
aes ies / " Salt " et m 
b= (2) yxy + EY xy ae ae 2 


In this expression for e% the first summand is orthogonal e2 and the second is 
zero. Thus, ||? = ay”, yxy”). 
(b) Let us express the derivatives of y with respect to s in terms of the derivatives 


of y with respect to f: 


dy _ dy dt 
ds dt ‘ds’ 
dy dy (dt\? dy at 
= (2) dt ds2’ 


dy dy (dt on 
ds dt? \ds bate 
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where the, dots in the last formula denote a linear combination of the vectors 
a and oy Using the formula for the curvature of a curve with an arbitrary 


paraiictenantion given in Problem 2.4, we obtain 


6 
(Vy Ves Vs Cis r ").(4) 


x= to = 4 ee 04.42%; ds 


k? (x ¥%') 


We also have (y’?)3 — (4s)°, because 


dy |" _ dy |? ds\? _ ds\ 

dt|| || ds dt) ~\dt} © 
2.6 By condition ||y(s) — xo||?7 = R? = const. Differentiating this equation, we 
obtain 2(y(s) — xo, e1) = 0. Therefore, 


d d 
0= . (y(s) — xo, e1) = (e1, e1) + (V(S) — x0, —e1) 
Ss ds 


= 1+ (y(s) — x0, kez). 


Thus, k(y(s) — xo, e2) = —1. The length of the vector y (s) — x9 equals R, whence 
1=k\|(y(s) — Xo, e2)| < KR, which implies the required inequality k > 1 : R. 

Let y(s) — x9 = Ae, + wer + ve3. Then dA = (y(s) — x0,e1) = 0, w = 
(y(s) — xo, e2) = —1/k = —p, and v = (y(s) — x0, e3). Hence 


d 
-p' = a, 0) — Xo, €2) = (€1, €2) + (v(s) — x0, —ke1 + e3) 


= 0—k(y(s) — x0, e1) + *(y (5) — x0, 63) 
= 0+ x(y(s) — x0, €3). 
Therefore, v = (y(s) — x0, e3) = —p'/x = —p’o. 


Since the vectors e2 and e3 are mutually orthogonal and have unit length, it 
follows that ||v(s) — xoll? = || — pe2 — p’oes||* = p* + (p'o)?. 
2.7 First, suppose that the curve y (s) lies entirely on a sphere centered at xo. Then, 


according to Problem 2.6, we have x9 = y(s) + pe2 + p’ce3. Therefore, 


dx de de 
0=— =e + per + p—= +t (ploy'e3 + plo— 
ds ds ds 


=e, + p'e.+ p(—ke, + xe3) + (p'c)'e3 — p'oxe2 


=(£+('oy)es 


Thus, £+ # (0) =0. 
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Now suppose that o + £ (c #) = 0. Consider the point xo(s) = y(s) + pe2 + 
p'ae3. The calculations performed above show that ao = (£4+(p'o) jes = 0, 


i.e., the point x9(s) = xo is fixed. Moreover, 


d 2_ 4d 2 FaN2\ ! p ies ah eee 
= xo ~ v(s)I? = (0? + (p'0)*) = 2p'o (£ + (p'0') =0. 


This means that all points of the curve y(s) are at the same distance from xo. 


2.8 Let y(s) be a curve with natural parameterization on a sphere centered at 


xo. We can assume that x9 = 0. In the solution of Problem 2.6 we obtained the 


equations (y, €2) = -; and -£ (+) = x(y, e3). In the case where the curvature k 


is constant, the second equation takes the form x(y, e3) = 0. It follows that x = 0 
at all points of the curve. Indeed, suppose that x 4 0 at some point. Then x 4 0 
and hence (e3, y) = 0 in some neighborhood of this point; differentiating the last 
equation, we obtain 


O = (—xe2, v) + (€3, €1) = —2(E2, y). 


It follows from (y, e2) = -t that (e2, y) 4 0, whence x = 0. 
According to Problem 2.3 (b), a curve with identically vanishing torsion lies in 
some plane. 


2.9 First, suppose that the curve y lies on a sphere centered at x9. We can assume 
that x9 = 0. In the solution of Problem 2.6 we obtained the equations (y (s), €2) = 


-; and -£ (t) = x(y(s), e3). Let us introduce the function f(s) = —(y(s), e3). 


The second equation is written in its terms as fx = £ (t). We have 


d d d 
a —— Gar) _ (rw, =) = (y(s), x2) = -5; 


in deriving the last equality, we have used the relation (y (s), €2) a -; 
Now suppose that, for the curve y, there exists a function f for which the above 
equations hold. We set x(s) = y(s) + Ler + fe3. Then 


dx dy d (z) 1 de2 df de3 
—_>_> = e 
k 


ae egg cd 


d {1 1 d 
=e,t+ as (z) eg + rea + e3) + oes — fxeg=0. 
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Therefore, x(s) = xo is a fixed point. Next, 


d 2 d 1 2) _ ld ’ ail = 
qs lv (9) — oll -i(at+r)=2(75 Gg) +a) =o 


and hence the curve y(s) lies on the sphere of radius a + f? centered at xo. 


2.10 First, suppose that the specified identity holds for some constants A and B. 
Then, clearly, k(s) 4 0. Let us write the identity in the form 


S Ss 
Acos [ rds + Bsin [ xds = —— (2.4) 
0 0 


and differentiate it with respect to s. As a result, we obtain 


Ss Ss d 1 
(-A sin | #xds+ Bos [ x ds) x(s) =—([-}]. 
0 0 ds \k 


Consider the function 


Ss 


Ss 
fo= —asin [ xds + Beos [ nds. 
0 


0 


Obviously, fx = 4 (7). It is also easy to check that af = — . Applying 


Problem 2.9, we obtain the required result. 
Now suppose that a smooth curve y(s) lies on a sphere. Then, according to 


Problem 2.9, there exists a function f for which fx = om (+) and af = —f. 
Consider the function 
AY 
6(s) = xds, 
0 
1 
g(s) = —~cosO(s) — f(s) sinO(s), (2.5) 
k(s) 
1 
h(s) = —~sin@(s) + f(s) cos 6(s). (2.6) 
k(s) 


It is easy to see that g’(s) = 0 and h'(s) = O. Thus, g(s) = A and h(s) = B, 
where A and B are constants. Relations (2.5) and (2.6) can be regarded as a system 
of equations for k and f. Solving this system, we obtain i = Acos@+B sin@. This 
coincides with the required identity (2.4). 

It remains to prove that if identity (2.4) holds, then the radius of the sphere 
containing the curve equals A? + B2. According to Problem 2.9, the squared 
radius of the sphere equals a + f?.In the case under consideration, i = Acos@+ 


Bsin@ and f = —Asin@ + Bcos@. Therefore, a +f? =A*+B?. 
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2.11 First suppose that y is a curve of constant slope. Then (e,e1) = cosa for 
some fixed angle a. If cosa = +1, then e} = +e. Hence y is a straight line, and 
k = 0, which contradicts the assumption. Therefore, cosa 4 +1. We have 


d d 
0 = —(e, e1) = (e, —e}) oo (e, ke) = k(e, e€2), 
ds ds 
which implies e21e. Thus, e = (cosa)e; + (e3, e)e3. Changing the sign of @ if 
necessary, we obtain e = (cosa@)e; + (sina)e3. Therefore, 


0 = —e =cosa—e, + dupe. = (cosa)ke2 — (sina)xe. 
ds ds ds 
Thus, x = (cota)k. 

Now suppose that x = ck. If c = 0, then x = 0, and hence y is a plane curve. In 
this case, for e we can take the vector orthogonal to the plane of the curve. In what 
follows, we assume that c ~ 0. Choose an angle a with cota = c and consider the 
vector e = (cosa@)e; + (sina)e3. This vector has the required properties, because 
(e, ej) = cosa = const and fe = (cosa)ke2 — (sina)xe2 = 0. 


2.12 First, suppose given two smooth curves y (s) and y(s) with the same curvature 
k(s) and torsion x(s). Let e1, e2, e3 and @1, é2, é3 be their Frenet—Serret frames. 
Then 4 (e121 + e2€2 + €323) = 0, because 


e1é1 + e1e} = k(ere, + e1€2), 

62 + €2@) = —k(e1€2 + e2€1) + x (e362 + €263), 

e463 + €364 = —x(e2€3 + €3€2). 
We can assume that at s = 0 we have e1é; + e2€2 + €3€3 = 3, 1.e., e] = €1, €2 = €2 
and e3 = 3. Then these relations hold for all s. This proves the uniqueness of a 
curve y with given k(s) and x(s). 


To prove the existence of such a curve y, consider the system of differential 
equations 


=~ = kun, so = —ku, + xu3, <a = —xu2, 
uy Uj] U12 U3 
where u1(s), u2(s), u3(s) € R?. We set U = ur} = | ua u22 u23] and 
u3 U3] U32 U33 
0 k O 
A= |-k 0 x J. Then, we find a solution of the system 4u = AU with initial 
0 —x 0 


condition U(O) = J (the identity matrix) and put xi(s) = i, ujj(t) dt, so that 
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a” = uy; and x! (0) = 0. Let us check that the curve y(s) = (x!(s), x7(s), x3(s)) 
has given curvature k(s) and given torsion x(s). Since the matrix A is skew- 
symmetric, we have 4£(UTU) = (<u7)u + U7 (<u) =U'TATU+UTAU = 
U?(—A)U + U'AU = 0. For s = 0, the matrix U(s) is orthogonal, because 
U'U = 1. Therefore, for all s, we have U(s)’ U(s) = TI, ie., U(s) is orthogonal 
everywhere. Thus, the matrix U(s) determines the Frenet—Serret frame of the 
curve y. 


2.13 


(a) First, note that if the length of a connected closed curve on the unit sphere is 
less than 25 < 27, then this curve contains no diametrically opposite points 
of a spherical circle of diameter 6. Indeed, if a connected closed curve passes 
through two diametrically opposite points, then these points divide it into two 
arcs, each of length at least 6. 

Choose two points A and B on the curve w so that they divide w into two 

arcs of the same length. These points are not diametrically opposite points of the 
sphere; therefore, the shortest arc AB on the sphere is uniquely determined. Let 
M be the midpoint of this arc. We claim that the curve w lies on the spherical 
disk D of diameter 5 centered at M. To show this, first, note that the points A 
and B lie in the disk D, because they are joined by a curve of length 6 and 
hence the length of the shortest arc of the sphere joining them is at most 6. Now 
suppose that at least one of the two arcs into which the points A and B divide 
the curve w intersects the boundary of D. We denote this arc by w;. Consider 
the curve | symmetric to @; with respect to the straight line joining M with 
the center of the sphere. From the curves @; and w, we can compose a closed 
connected curve. On the one hand, the length of this curve equals that of a, 
i.e., is less than 25. On the other hand, the arc w; contains a point C of the 
boundary circle of D, and therefore the arc w{ contains the point of this circle 
diametrically opposite to C. We have arrived at a contradiction. 
It follows from assertion (a) that if the length of a connected closed curve on the 
unit sphere is less than 27r, then this curve lies on one side of a plane passing 
through the center of the sphere. Precisely the same argument as in the first 
proof of Fenchel’s theorem shows that any plane passing through the center 
of the sphere intersects the velocity indicatrix in at least two points. If follows 
that the velocity indicatrix cannot lie on one side of such a plane; therefore, the 
length of the velocity indicatrix is at least 277. 


(b 


ma 


2.14 According to Fenchel’s theorem, we have 27 < Ja k(s)ds. Using the 
Cauchy—Bunyakovsky—Schwarz integral inequality, we see that 


i 2 1 I 1 
(Qn)? < (/ k(s)ds) < | as- f Pods =1 f k*(s)ds < Pk, 
0 0 0 0 


whence / > 27/k. 


Chapter 3 ®) 
Surfaces in Space od 


One of the most widespread ways of specifying a surface in R? is the parametric 
one, when a surface is given by three functions x(u, v), y(u, v), and z(u, v). In 
other words, a domain in R? with coordinates (u, v) is mapped to R?. The induced 


map takes the basis vectors directed along the u- and v-axes to the vectors e, = 


(Xu, Yu, Zu) = (%. ae ie) and ey = (Xv, yy, Zv) tangent to the surface. We will 


largely consider smooth surfaces, for which the functions x, y, and z are smooth and 
the vectors e, and e, are linearly independent at each point of the surface. 

A particular case of the parametric specification of a surface is the graph of a 
function f(x, y); in this case, x = u, y = v, andz = f(u,v) = f(x,y). A 
parametrically specified smooth surface can be locally represented as the graph of 
some function by drawing the normal line and tangent planes at a given point of the 
surface and taking coordinates in the tangent plane as x and y and a coordinate on 
the normal line as z. 

For a surface which is the graph of a function f(x, y), tangent vectors are 
(1,0, fy) and (0, 1, fy). 

Surfaces do not always admit a global parameterization; in what follows, 
we will consider surfaces whose different regions are endowed with different 
parameterizations. Compact surfaces (without boundary) are called closed surfaces. 
In this chapter we consider surfaces in R?: this means that, in addition to a surface 
as a two-dimensional manifold, a map of this manifold to R? is given. If this map is 
a homeomorphism onto its image, then we say that the surface is embedded. If the 
map not subject to any restrictions (except the requirement that the surface must be 
smooth, i.e., the vectors e,, and e, must be linearly independent at each point of the 
surface), then we say that the surface is immersed. 

With obvious rare exceptions (when the boundary of a surface is mentioned 
explicitly), we consider surfaces without boundary. 
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66 3 Surfaces in Space 
3.1 The First Quadratic Form 


The first quadratic form of a surface in R? is defined on tangent spaces at the points 
of this surface; to each tangent vector it assigns its inner square (as a vector in R?). 


HISTORICAL COMMENT The first quadratic form was introduced in 1822 by 
Carl Friedrich Gauss (1777-1855) in the solution of the problem of superposing 
(isometrically mapping) one surface onto another. 


Consider a curve y(t) = (x(u(t), v(t)), ....) on a parameterized surface. Clearly, 
dx dx du | dx dv 
dt du dt dv dt’ 


so that the velocity vector of this curve is y’ = u’e, + v’ey. The squared length 
of this vector equals Eu’? + 2Fu'v' + Gu’, where E = (ey, ey), F = (eu, ev) = 
(€y, €y), and G = (ey, é,). In this notation the first quadratic form is often written 
as E du* + 2Fdu dv + G dv’. The equation F = 0 means that the coordinate lines 
u = const and v = const are orthogonal. 

Using the first quadratic form, we can calculate the length of a curve and the area 
of a surface. The length of a curve can be expressed as the integral 


t 
/ VEU? + 2a + Gat = f VE dut + 2Fdudo + Gade. 
i) 


The area element equals the area of the parallelogram with sides e,du and e,duv, 
and the area of the parallelogram spanned by the vectors e, and e, equals the length 
of their cross product e, x ey. Therefore, the area of a surface can be expressed as 
the integral 


[ies X ey||du dv. 


Let us show that |le, x ey || = VEG — F2. To this end, we apply the formula 
(a x b,c x d) = (a, c)(b, d) — (a, d)(b, c) 
to lle, x ey ||? = (€y X Cy, Cy X Cy): 
lleu x evll” = (€u x ev, Cu X &v) = (Cus u)(Cv, Cv) — (Cu ev)” = EG — F°. 


Thus, the area of the surface can be expressed as the integral 


[ VEG=Pauav. (3.1) 
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In differential geometry, even in the two-dimensional case, some formulas are 
fairly cumbersome, and it is more convenient to use index notation. For this reason, 
in addition to the notation E du? + 2Fdu dv + G dv”, which is convenient in many 
situations, the notation g;; for the coefficients of the first quadratic form is also 
used: this is the inner product of the tangent vectors e; and e; to the surface that 
correspond to the basis vectors in the plane R?. It is convenient to specify the first 
quadratic form by the coefficient matrix (g;;). For example, if the surface under 
consideration is the graph of a function f(x, y), then e; = (1,0, f,) and eg = 


14+ f2 fiefs 
Ihr iff.) 


(0, 1, fy), and hence the matrix of the first quadratic form is 


this notation formula (3.1) takes the form 


/ [1+ f2+ ffdx dy. 


Example 3.1 For the hemisphere z = \/1 — x? — y?, where x*-+y” < 1, the matrix 
of the first quadratic form equals 


1 ly? xy 
1—x2-y? mg Tox) 
Proof We have 


a¥ = 
e; = | 1,0, —— and e2 = {0, 1, ————. ] . 
1—x2-y? 1—x2—y2 


Therefore, e.g., 


’ =1 ae, Tee, 
(e1,e1) ae 


Oo 


The first quadratic form on a surface determines an inner product (and, thereby, 
a metric) in the tangent space to the surface. For this reason, it is also called a 
Riemannian metric or a metric tensor. In differential geometry a need for inner 
product not only in the tangent space but also in the dual (cotangent) space arises. 
There is a standard construction of inner product in the dual space, which we 
will now describe. Here and in the sequel, we use the following convention: when 
in a sum >); a‘b; the index of summation occurs twice, as a subscript and as a 
superscript, the summation sign is omitted, i.e., the sum is written in the form a');. 

If a vector space V is equipped with an inner product (v, w), then V can be 
identified with the dual space V* by associating each vector v with the covector v* 
such that v*(w) = (v, w) forall w € V. After that, an inner product in V™* is defined 
by (v*, w*) = (v, w). 
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Given a pasts €1,.--, €n ina space V, the corresponding basis in the dual space 
V* is el, . K where (ej) = 5 (here j= = 6;; is the Kronecker delta; we will 
write it in ae tensor form, with sub- and superscripts). Let us show that if the inner 
product in V is defined by a matrix G = (g;;) in the basis e1, ..., én, then the inner 
product in the dual space is defined by the matrix G~! = (g'/) in the dual basis 


e!,..., e”. First, note that UF = gij v'. Indeed, on the one hand, v*(w) = (v, w) = 


gijv'w!, and on the other hand, v*(w) = vy ‘(wle; j) =v; twit = view) We 
will represent the vector v as a row of coordinates and the covector’ v* as a column 
of coordinates. Then v* = v!G, (v,w) = v! Gw, and (v*, w*) = v* X(w*)!, 
where X is the required matrix. Since v* X (w*)’ = v'’GXG" y, it follows that 
GXG' =G, and hence x= = (G")~! = G“!, because the matrix G is symmetric. 

The formula v* =a v! and the a formula v/ = g!/ vu; are the coordinate 
representations of the correspondence between vectors and covectors when the 
given vector space or its dual space is equipped with inner product. 


Example 3.2 For the hemisphere in Example 3.1, the matrix (g’/) equals 
[=4° —xy 
ee a 
Problem 3.1 Calculate the matrices (g;;) and (g‘/) for polar coordinates (r, g) in 
the plane. 


Problem 3.2 Calculate the matrices (g;;) and (gi/) for the surface of revolution 


(f(u) cos v, f (u) sin v, u). 


3.2. The Darboux Frame of a Curve on a Surface 


Suppose that a smooth surface § C R? is oriented, i.e., at each point of S a unit 
normal vector is given. If a curve y(s) lies on the surface S, then, in addition to the 
Frenet—Serret frame, each point of the curve can be assigned the Darboux frame &1, 
€2, €3. We assume that the parameterization of the curve is natural. Then ¢) is, just as 
in the case of the Frenet—Serret frame, the tangent vector to the curve (the velocity 
vector), €3 is the normal vector to the surface S, and €2 is the uniquely determined 
vector for which €1, €2, €3 is an orthonormal positively oriented frame. 


HISTORICAL COMMENT The Darboux frame was introduced by Jean Gaston 
Darboux (1842-1917) in his four-volume 1887-1896 lectures on the general theory 
of surfaces. 


The motion of the Darboux frame is described by the equations 


€) = ae. + bes, 
/ 
Ey = —a&| + CE3, 


€3 = —be| — cep. 
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The coefficients a, b, and c bear the following names: a = kg is the geodesic 
curvature, b = ky is the normal curvature, and c = %g is the geodesic torsion. 
HISTORICAL COMMENT In 1830 Ferdinand Gotlibovich Minding (1806-1885) 
proved that the quantity = a , where R is the curvature of a curve on a surface 
and ¢ is the angle formed by the osculating plane of the curve and the tangent plane 
of the surface, belongs to the intrinsic geometry of the surface. Later, in 1848, Pierre 
Ossian Bonnet (1819-1892) called it geodesic curvature. 


The vector ¢/ is the acceleration vector of a curve with natural parameterization 
(the derivative of the velocity vector). Therefore, the normal curvature is the normal 
(with respect to the surface) component of the acceleration vector, and the geodesic 
curvature is the component of the acceleration vector that lies the tangent plane. 

If the normal vector and the curve are on the same side of the tangent plane in 
a neighborhood of some point, then the normal curvature of the curve is positive at 
this point, and if the normal vector and the curve lie on different sides, then it is 
negative. 

Under the reversal of curve orientation k, and 2, remain intact, while kg changes 
sign. Indeed, on the change of the curve orientation, the vector ¢3 remains the same, 
while ds, €;, and €2 change sign. 


Problem 3.3 Prove that the normal curvature k, and the geodesic torsion %, are 
completely determined by the surface S and the direction of the tangent to the curve. 


Problem 3.3 shows that the notion of normal curvature can be defined not only 
for curves on a surface but also for the surface itself. The normal curvature of a 
surface at a point p in the direction of a tangent vector v is the curvature of the 
intersection curve of the surface with the plane through p containing the vector v 
and the normal 7 to the surface. The acceleration at p of the curve thus obtained is 
directed along the normal to the surface; therefore, its normal curvature is equal to 
its curvature. Thus, the normal curvature of the surface at p in the direction v is the 
normal curvature k, of any curve on this surface whose tangent vector at p is v. 


Problem 3.4 Let e1, e2, e3 be the Frenet—Serret frame of a curve y(s) on a surface 
S, and let €;, €2, €3 be the Darboux frame. Prove that 


do 
kg =ksinO, k,=kcos@, and a ae 
Ss 


where @ is the angle between the vectors e2 and ¢3 (that is, between the principal 
normal to the curve and the normal to the surface). 

Problem 3.5 Prove that kj; + kz =k’. 

Problem 3.6 


(a) Consider the family of planes passing through a given point of a surface and 
containing a given tangent vector. Let k be the curvature of the intersection curve 
of the surface and a normal plane (i.e., a plane containing a normal vector), and 
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let k(@) be the curvature of the intersection curve of the surface and the plane 
making an angle @ with the normal plane. Prove that k = k(@) cos@. 

(b) Prove that the osculating circles of the curves in the family obtained above fill 
a sphere. 


HISTORICAL COMMENT Problems 3.3-3.6 are related to Meusnier’s theorem. In 
1776 Jean-Baptiste Meusnier (1754-1893) announced the theorem that all curves 
with common tangent on a surface have the same normal curvature and the 
osculating circles of these curves fill a sphere (it is known as Meusnier’s theorem). 
The equation k, = kcos@ in Problem 3.4 is called Meusnier’s formula; a different 
interpretation of Meusnier’s formula is given in Problem 3.6 (a). Another proof of 
Meusnier’s formula is given on p. 73. 


and 


>l- 


Problem 3.7 Prove that if a curve lies on a sphere of radius R, then k, = + 
%g = 0. 


Problem 3.8 Find the geodesic curvature of a circle of radius r on a sphere of 
radius R. 


3.3 Geodesics 


A curve y(s) with natural parameterization on a smooth surface is said to be 
geodesic if its geodesic curvature kg identically vanishes. A curve with natural 
parameterization is geodesic if and only if its acceleration vector is orthogonal to 
the surface at each point. A curve with an arbitrary parameterization is said to be 
geodesic if its acceleration vector is orthogonal to the surface at each point. The 
geodesics obtained from a geodesic y(s) by changing the parameterization have the 
form y (at + b), where a and b are constants, a 4 0. Indeed, we have 


dy dy (ds\? dy ds 
dt? ds? \dt ds dt?’ 


2y ; sige 
and hence the vector me is orthogonal to the surface if and only if rs = 0. 


Example 3.3 The curve y(s) = (coss, sins, 0) on the sphere x24 y? +22 = lis 
geodesic. 


Proof The velocity vector ¢) of the curve y equals (— sins, cos s, 0), whence ¢) = 
(—coss, — sins, 0) = —y(s). This vector is orthogonal to the surface of the sphere 
at the point y(s). 

Another proof uses Problem 3.8: the geodesic curvature of a circle of radius R 
on a sphere of radius R equals zero. Oo 


A shortest curve joining two points is geodesic (see Theorem 3.1 on p. 71). It is 
easy to see from Example 3.3 that the converse is false: the curve in that example 
may wind around itself, making arbitrarily many coils. However, the converse is 
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false only globally. Locally it is true: any sufficiently short part of a geodesic is a 
shortest curve joining given points (see Theorem 3.19 on p. 104). 


Problem 3.9 Prove that if a surface contains a straight line, then this line (endowed 
with the natural parameter s) is geodesic. 


Problem 3.10 Suppose given a surface of revolution (i.e., a surface swept out by a 
plane curve rotating about an axis lying in the plane of the curve). 


(a) Prove that a meridian (i.e., one of the two symmetric curves in the intersection 
of the surface and a plane containing the axis of rotation) is a geodesic (the 
parameterization of the curve is assumed to be natural). 

(b) Prove that a parallel (i.e., the circle traced by a point A of the rotating curve) is 
geodesic if and only if the tangent to the meridian at A is parallel to the axis of 
rotation (the parameterization of the curve is assumed to be natural). 


Theorem 3.1 /f a curve y(s) with natural parameterization is shortest among all 
curves joining points A and B ona given surface, then this curve is geodesic. 


Proof Suppose that k,(so) 4 0 for some so. Then kg(s) does not vanish on some 
segment [a, b]. It suffices to consider the case where A = y(a) and B = y(b). 
Indeed, any segment of a shortest curve must be shortest among all curves joining 
its endpoints. 

Let y(s) = r(u'(s),u(s)), where r(u!, u*) is the parameterization of the 
surface, and let (v!, v*) be the coordinates of the vector ¢2 with respect to this 


parameterization, i.e., ¢2 = v! on + vey or . Consider the family of curves 


vi(s) =r(ul(s) + ta(s)v', u2(s) + tA(s)v”), 


where A(s) is a smooth function vanishing at the endpoints of the segment [a, b] 
and positive at the other points of this segment. For each ft, the curve y;(s) joins the 
points A and B, and at t = 0 we obtain the initial (shortest) curve. Therefore, if L(t) 
is the length of 7;, then L’(0) = 0. 


> 1/dR OR 
Let R(s,t) = y;(s). Then L(t) = / —,— ]ds; therefore, 
a ds os 


(<5 aR 
V@O= es (=. =) (#4, (ei) 
7 [( aR aR) 
For t = 0, we have 2% = a and hence (2%, 2%) = | a || = 1. Thus, L’(0) = 


ds. Taking into account the equality 


i: a2R *) 

a \ds dt’ ds /| <0 
d (dR *\-(= *\+(2 7s) 
ds \ at’ as) \ datas’ as ar’ as2 )’ 
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we obtain 
vo= d (dR OR F [ dR JR 
“eds \ Ot! Os loa des KOE? Ost 


Next, we have af o= ; Ms)v! aa = A(s)é2 and A(a) = A(b) = O. Therefore, 


ot It= 
oR OR 
ig (=. ~*) 
t=0 ot Os 


le aR OR 
a ads \ at’ ds 


= kgé2 + kn€3. Thus, 
t=0 


ds. 
t=0 


b 
= 0. 


t=0la 


2 


0 
Finally, ae 
Ss 


b b 
L'(0) = -| (A(s)é2, kge2 + kye3)ds = -| A(s)kg(s) ds A 0. 


a 


We have arrived at a contradiction. oO 


3.4 The Second Quadratic Form 


Let S be a smooth oriented surface in R? specified parametrically, which means that 
the points of S are given in the form r(u, v). Consider a curve r(u(t), v(t)) on the 
surface S. To the tangent vector r’ the first quadratic form assigns its inner square 
(r’, r’) and the second quadratic form, the inner product (r’, —n’), where n is the 
unit normal vector to the surface. There is also yet another expression for the second 
quadratic form: since (r’, n) = 0 and, therefore, (r’, n’)+(r”, n) = 0, it follows that 
the second quadratic form assigns the inner product (r”, n) to each tangent vector r’. 

At first sight, it may seem that (r’, —n’) depends on uw’ and v’ linearly rather 
that quadratically. However, not only r’ but also n’ depends on uv’ and v’. From the 
second expression for the second quadratic form it is clearly seen that this form is 
quadratic. Indeed, we have r’ = ryu!+ryv! andr” = ryy(u’)* +2ryyu'v! tryy(v')?. 
Taking into account the independence of the normal vector n of u’ and v’, we see 
that (r”,) is a quadratic form in u’ and v’. The second quadratic form is often 
written as L du* + 2M dudv + Ndv*. Here L = —(ry,ny) = (run,n), 2M = 
—(Tu, Mv) — (Ty, Nu) = 2(ruv,n), and N = —(ry, Ny) = (ov, 2). 


Example 3.4 Ifa surface is given as the graph of a function z = f(x, y) such that 
7, 0) = 0 and (0, 0) = 0, then at the point x = y = 0 the second quadratic 
Fax Fey 


xy Jyy 
the Hessian, and its determinant is called the Hessian determinant. 


form is given by the matrix ( ) This matrix is called the Hessian matrix, or 
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Proof The given surface can be specified parametrically as r(x,y) = 
(x, y, f(x, y)). At the point under consideration we have n = (0, 0, 1) and 


r’ = (x", y" te (x’)? =p Rix y A fe (y’)?). 


Calculating the inner product of the vectors r” and n, we obtain the required result. 
| 


HISTORICAL COMMENT Otto Hesse (1811-1874) introduced Hessian in 1842 in 
studying curves of the third and the fourth degree. He considered equations in 
homogeneous coordinates, i.e., the curves were determined by functions of three 
variables and the Hessian was of order 3. 


Consider a curve r(s), where s is the natural parameter, on a surface S. The vector 
r” is directed along the principal normal e2 to the curve and equals the curvature k of 
the curve in magnitude. Therefore, (r”,n) = k cos 0, where 6 is the angle between 
the normal n = ¢3 to the surface and the principal normal e2 to the curve. If the 
tangent vector r’ is fixed, then (r’,n) = ko is the value of the second quadratic 
form at the vector r’ (which has unit length). Hence 


kcos@ = ko, 


where ko is the curvature of the intersection curve of the surface and the plane 
containing the vectors n and r’. 

We have obtained another proof of Meusnier’s formula, which we already 
discussed in Problems 3.4 and 3.6 (a). Recall that the number ko is called the 
normal curvature of the surface in the given direction r’. When considering vectors 
of arbitrary length, rather than only unit ones, we can represent ko as the fraction 
II/I, where II and I are the values of the second and the first quadratic form at the 
given vector r’. 

For any two real symmetric matrices A and B such that A is positive definite, 
there exists a matrix X for which X’ AX = / is the identity matrix and X’ BX = D 
is a diagonal matrix. The diagonal entries of D are determined by the condition 
det(B — dA) = 0. In the case of interest to us, A = (g;;) is the matrix of the first 
quadratic form and B is the matrix of the second quadratic form. The entries k; and 
k2 of the corresponding diagonal matrix are called the principal curvatures of the 
surface at the given point. The number H = 5 (ky +k?) is called the mean curvature, 
and K = kjkz is called the Gaussian curvature. 

The reversal of the surface orientation, i.e., the replacement of the vector n by 
—n, changes the sign of the principal curvature. 


HISTORICAL COMMENT Gauss introduced Gaussian curvature in 1827 as the 
ratio of the areas of infinitesimal figures on the sphere and on the given surface 
corresponding to each other under the spherical Gauss map. Before Gauss, the 
curvature of a surface was defined in precisely the same way in 1815 by Benjamin 
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Olinde Rodrigues (1795-1851). Rodrigues also proved that Gaussian curvature 
equals the product of principal curvatures. 


The expression for the Gaussian curvature of a surface of revolution is par- 
ticularly simple in the case where the parameterization of a meridian is natural. 
Therefore, in calculating the Gaussian curvature of a surface of revolution, we en- 
dow the surface with a parameterization of the form (f(u) cos v, f(u) sinv, g(u)). 
Then the parameterization of a meridian by u is natural if (f’)? + (g’)? = 1. 


Problem 3.11 Calculate the second quadratic form and the Gaussian curvature K 
of a surface of revolution (f(u) cos v, f(u) sin v, g(u)). Show that if the parameter- 


ization of a meridian is natural, then K = -£. 


Problem 3.12 A closed surface embedded in R? is contained in a ball of radius R. 
Prove that the Gaussian curvature of this surface is at least 1/R? at some point. 


For a basis {e;} with respect to which the matrix of a quadratic form q is diagonal, 
the equation g(>_ Aje;) = > ig (e;) holds. Therefore, if e; and e2 are unit tangent 
vectors corresponding to the principal curvatures k, and k2, then the unit vector 
€| COS g + e€2 sing corresponds to the normal curvature 


ky cos* g+k sin? g (Euler’s formula). 


HISTORICAL COMMENT In 1760 Leonhard Euler (1707-1783) obtained the fol- 
lowing expression for the curvature radius r: 


no 2f8 
f+s—(f -—g)cos29’ 


where f and g are the maximum and minimum curvature radii. The more convenient 
expression given above was derived by Charles Dupin (1784-1873) in 1813. 


Problem 3.13 Let us fix a point g in R? and consider the function F (p) = || p—q||7 
on a surface S. 


(a) Prove that a point p is critical if and only if the vector Pq is orthogonal to S 
at p. 

(b) Prove that a critical point p is degenerate if and only if the length of the vector 
Wa equals 1/|k;|, where k; is the nonzero principal curvature of the surface at 
p, and this vector has the corresponding direction. 


Problem 3.14 Suppose that a surface S contains entirely a straight line /. Prove that 
the Gaussian curvature of the surface is nonpositive at each point of this line. 


3.5 Gaussian Curvature 75 
3.5 Gaussian Curvature 


In Sect. 3.4 the Gaussian curvature K was defined as the product of principal 
curvatures, and the mean curvature H was defined as their half-sum. In this section 
we discuss Gaussian curvature in more detail. First, we obtain an explicit expression 
for Gaussian and mean curvature in terms of the coefficients of the first and second 
quadratic forms (see Theorem 3.2). Gaussian curvature differs from mean curvature 
in that it can be expressed in terms of only the first quadratic form (see Theorem 3.3). 
Thus, Gaussian curvature is an intrinsic invariant of a surface: it does not depend on 
the embedding of the surface in space and does not change under bendings, that is, 
transformations preserving the metric. 


Theorem 3.2 Let A and B be the matrices of the first and second quadratic forms. 
Then 


detB LN —M?2 aoe 1 LG-2MF+NE 
— = an =_-:-ke 
detA EG-— F? 2 EG — F2 


Proof The principal curvatures are the roots of the equation 


L-kE M-kF|__ 


M—kF N—kG|~” 
1.€., 
(EG — F*)k? — (LG —2MF + NE)k+ LN — M? =0. 
Therefore, the product of principal curvatures equals NM and their half-sum 
equals 5 : ee Oo 


The expression of Gaussian curvature in terms of the first quadratic form is 
much more cumbersome, but the calculation itself is fairly simple. In the expression 
K = LN—M* we must get rid of the coefficients of the second quadratic form. 
For this purpose, it is convenient to use the notation of Sect. 3.4: the surface under 
consideration is parameterized as r(u, v), n is the unit normal vector to the surface, 
and r, and ry are the partial derivatives of r with respect to u and v. For the 
coefficients of the second quadratic form we use the expressions L = (ryy,7), 
M = (ry, n), and N = (ryy,n). The vector n can be expressed in terms of 7, and 
ry as well: the squared length of the vector r,, xr, equals rope (Tus ry)? = EG—F?, 
whence n = —2X 


J EG—F2 
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Theorem 3.3 (Gauss) The Gaussian curvature K is expressed in terms of the 
coefficients of the first quadratic form as 


— 


—5Guu + Fuy — 5 Evy 7Eu Fu — 4 Ey 0 5Ey 7Gu 
K(EG— F’)? = Fy — Gu E F |-|5E, E F 
Gy FG 3Gu F G 
Proof Let us substitute the expressions L = (ras Stites, M = 
ry Xr ryuxr : 
Tuy, ~ === }, and N = |{ ry, —_~ ] for the coefficients of the second 
| ( ee | 


: : —M2 ; 
quadratic form into the formula K = ot. We obtain 


K(EG — F*)* = (uu, Tu X Tv) + Cove tu X Ty) — Cus Fu Ti) 


Tuu Tov Tuv Tuv 
=det| r, | -det| r, | —det] r, |] - det] r, 
ly ly ly ly 


In the last expression the symbols r,,,, 7,, etc. in the matrices denote the coordinates 
of the corresponding vectors written in a row. Let r7,, rJ’, etc. denote the same 
vectors written in a column. Then 


K(EG — F*)? 
Tuu Tuv 
= det] r, } - det (a. re rr) —det| r, | - det (a an i) 
ly ly 
(Yuu, Tov) Cuus tu) Suns rv) (uv, uv) Tuvs tu) Cuvs tv) 
=| (Tw, Mu) E F — | uv, Tu) E PF 
(Tov, rv) F G (Tuv, Tv) F G 
(Tun, Tov) — Cuvs tuv) Cun, Tu) Cun, tv) 0 (Tuvs Tu) uv, tv) 
= (ruv, Tu) E F — |(Tuv, Tu) E F 
(Tuv, Tv) F G (uv, Tv) F G 


(We have used the fact that the elements (ryy,,ryy) and (ryy, yy) occur in the 
determinants of the matrices under consideration with the same coefficient EG — 
F?.) 

Now we express the inner products in the obtained matrices in terms of 
the coefficients of the first quadratic form. First, we differentiate the equations 
(‘u,1u) = E and (ry, ry) = G with respect to u and v. As a result, we obtain 


1 1 1 1 
(Tuu, Tu) = 5 Eu (uv. Tu) = ak (Tuy, tv) = zou (uv. tv) = zor 
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Then, we differentiate the equation (r,, ry) = F with respect to u and v and use the 
equations already obtained: 


1 1 
Cus Tr) = Fu — 5 bv, (ov. Tu) = Fy — 5 Gu. 


It remains to calculate (74, rvv)—(“uv; Tuv). For this purpose, we differentiate the 
equation (Fyy,’y) = 5Gu with respect to u and the equation (yy, 7y) = Fy — sEy 
with respect to v: 


1 
(Tuvus tv) + uv, Tou) = zu 
1 
(uns tv) + (uns tov) = Fuv — 5 Bow. 
It follows that 
1 1 
(Tuus Tuv) — (uv, Tou) = 5 Guu + Fav - 3 Ev. 


Substituting the found values into the matrices, we obtain the required expression. 
oO 


HISTORICAL COMMENT The theorem that Gaussian curvature can be expressed in 
terms of the metric and, therefore, does not change under bendings (isometries) 
of surfaces was proved by Gauss in 1827. Gauss named it Theorema Egregium 
(remarkable theorem). 


3.6 Gaussian Curvature and Differential Forms 


Here we need only the simplest differential 1- and 2-forms on one- and two- 
dimensional manifolds, and so we give only the most necessary definitions. A 
differential 1-form @ is the object dual to a vector field, i.e., a linear function on the 
vectors of the tangent space. For example, to a function f its differential is assigned, 
which is the 1-form df = oF dx! taking the value of V’ at each vector field V. A 
differential 2-form is a skew-symmetric bilinear function on pairs of vectors. For 
example, the form dx! A dx/ takes the value V' W/ — V/ Wi’ at a pair of vectors V, 
W. In 2-dimensional space the 2-form dx! A dx? is basis, i.e., any 2-form is this 
form multiplied by some function. The differential of the 1-form w = fydx® is the 
2-form 


Ofo 


dw = dfy A dx® = “dx! A dx®. 
ox! 
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The differential of the k-form w = Fe ad" A+++ A dx'k is the (k + 1)-form 
dw = dfi,..i, A dx! A-+- A dx'k, 


Using this expression, it is easy to check that ddw = 0 and d(@ A @2) = da, A 
@2 + (—1)¥ a A da@ for a k-form @}. 

Now we are ready to give another definition of Gaussian curvature, which uses 
differential forms. Let S be a smooth oriented surface in R?* , and let €1, €2 be 
a moving frame in some domain of S, that is, a frame whose vectors ¢; and €2 
smoothly depend on the point and form an orthonormal frame of the tangent plane 
at each point. The basis ¢1, €2 is associated with the basis o!, ow” of the dual space 
(the space of 1-forms) determined by the condition wl (xley +x769) = x!. The form 
w! Aw nowhere vanishes; therefore, do! = aw! Aw andda* = Bo! Aw’, where 
a and # are smooth functions. Consider the 1-form Ort = aw! + Ba. It satisfies 
the equations 


do! =a, Aw and da* =-aji Aa. (3.2) 
Note that these two equations uniquely determine the 1-form wr. 
The form day is a 2-form. Therefore, dar = —Kw! A w’, where K is some 


number; this number is called the Gaussian curvature of the surface at the given 
point. This new definition of Gaussian curvature is equivalent to that given in 
Sect. 3.4; we will prove this later on (see p. 79). 


Theorem 3.4 The Gaussian curvature K does not depend on the choice of a moving 
frame. 


Proof The statement to be proved is local, so that we can assume that we work in 
a small neighborhood homeomorphic to the disk. In this case, any moving frame is 
obtained from the given one by rotating it through an angle smoothly depending on 
the point. 

We set @! = cos gw! + singw? and &* = — sing! + cos gw”. Applying (3.2), 
we obtain 


dé! = cos oda! + sin oda +dg A (—sin go! + cos pw’) 
= cos pw; hit = sin a; Ao! +dg rae = (wt + dy) A’, 


da” = —(w7 + dg) A G'. 


Thus, a; — or + dq. Therefore, da; — day. It is also clear that 


o! A = (a! cosy + a sing) A (—a! sing + w cosy) = w! Aa. 
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Remark If ¢, = x11&) + X12€2 and €2 = x21&| + x22€2, then xj; = o'(e1) = cos@g, 
xy2= @*(e1) = —sing, x21 = | (e2) = sing, and x22 = 0° (€2) = cosq. Thus, 
€] = COS PE — SIN YE? and €2 = singe; + COS Gép, 1.e., gy is the angle of rotation 
from the vector €; to &1. 


The form Ore which we used to define Gaussian curvature, is related to the 
geodesic curvature kg as follows: if e; and €9 are the first two vectors in the Darboux 
frame of a curve y(s), then the restriction of the form Ort to this curve is the form 
keds. To prove this, consider a moving frame ¢€1(x), €2(x), €3(x) in IR3, where 
x € R?. We have dx = w'¢;, where w! is the form to dual ¢;, and de; = w} &). 
So far, w! and wo; are the mere notations of some 1-forms on R?. But we will shortly 
show that the forms w!, w*, and O%3 which we used to define Gaussian curvature, 
are the restrictions to the surface of the forms w!, w?, and Ort on R?. 

The equations 0 = ddx = d(a'é;) = (da')é; — w' A de; show that dw! = 
a! Aw}. Similarly, the equation dde; = 0 implies 0 = d(wje;) = (do Je; — 
wo} A oh ek 1 (dw! Je _ ok A we j, 1.€., do; a ok A Oj. Moreover, if the frame is 
orthonormal, then (¢;, ¢;) = 6;;, and hence (¢;, de;)+(€;, dé;) = 0,ie., wo} +a = 
0. In particular, a} =0. 

The restrictions to the surface of forms on R? have zero component along the 
normal ¢3(x), and hence w* = 0. Therefore, dw! = w! A o} +07 A o} +a0°A Oo} = 
w A oh = —w" A wt and dw? = w! A wr. Thus, the forms w!, w?, and or used to 
define Gaussian curvature are indeed the restrictions to the surface of the new forms 
w!, w*, and wr on R?. 

In the new notation, the equations of motion of the Darboux frame for the 
restrictions of the forms to the curve y(s) are 


dé, = wre? + OFe3, 


dé. = or + W3E3, 
dé3 = —aFe\ - W3ED; 


2 Det 3 
therefore, wf = keds, w] = knds, and w; = xgds. 


Problem 3.15 Prove that dw; = @} A w;, da; = w} A 3, and da; = w}) Aaj. 
Problem 3.16 Prove that the restrictions of the forms to the surface satisfy the 
equation w! A OH +A w3 = 0. 


Let us now prove that the new definition of Gaussian curvature is equivalent to 
that given in Sect. 3.4. It suffices to check that the curvatures coincide at each point. 
Let r = r(u, v) be a parameterized surface. Suppose that the matrix of the first 
quadratic form at a given point is the identity, ie., (ry, 7,) = (7, ry) = 1 and 
(Ty, ry) = 0. We set ry = €1, ry = €2, andn = €3, where n is the normal vector. 
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Then dr = o!e r+ we, because the normal component of dr is zero. Therefore, 
the value of the second quadratic form at dr equals 


IW(dr) = (dr, —dn) = (oe + 072, whe ;). 


Suppose that o; = Aw! + Bo? and w3 = Co! + Do”. It follows from the 
equation wo! A oF +0 A w3 = 0 (see Problem 3.16) that B = C. Hence, in the 
notation of Sect. 3.4, we have dr = w!e; +*e2 = r,du+rydv anddn = whe; = 
—w7e| — @3£2 = —r, (Adu + Bdv) — ry(Bdu + Ddv). Since the matrix of the first 
quadratic form is the identity at the given point, if follows that, at this point, 


IW(dr) = (dr, —dn) = Adu? + 2Bdu dv + Dav’, 
1e., L = A, M = B,and N = D. Thus, 
wo; A @3 = (AD — BC)o! Aw = (LN — M’)o! Aa = deta! A a”. 
On the other hand, according to Problem 3.15, we have oF A w3 — —o} A ws — 


—dot = Ko! A @”. Therefore, det(I) = K. This is precisely what we need, 
because the matrix of the first quadratic form is the identity at the given point. 


Remark As already mentioned, Eq. (3.2) completely determines the form Oi. This 
means that the forms da; and w! A w are completely determined by the metric. 
Thus, it follows directly from the definition of Gaussian curvature in the language of 
differential forms that Gaussian curvature is completely determined by the metric. 


Problem 3.17 Prove that —w! ~ o3 +A oF = 2Ho! Ao”, where H is the mean 
curvature. 


3.7 The Gauss—Bonnet Theorem 


The Gauss—Bonnet formula relates the integral of Gaussian curvature over a polygon 
on an oriented surface to the integral of geodesic curvature over the boundary of 
this polygon and the sum of exterior angles of the polygon. From this formula an 
expression for the total Gaussian curvature of a closed oriented surface in terms 
of the Euler characteristics can be derived, which is known as the Gauss—Bonnet 
theorem. The Gauss—Bonnet theorem can also be proved without using the Gauss— 
Bonnet formula. We give both proofs. 

The Gaussian curvature K does not depend on the surface orientation; moreover, 
it can be defined for nonorientable surface. But the Gauss—Bonnet formula and the 
Gauss—Bonnet theorem involve the integral I Gk do, where do is the area form, and 
the area form exists only on orientable surfaces. In one of the proofs of the Gauss— 
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Fig. 3.1 The Gauss—Bonnet 
formula 


Bonnet theorem we cut the surface into parts and apply the Gauss—Bonnet formula 
to each part. This can be done only in the case where the surface is orientable. 

The Gauss—Bonnet formula applies to a simply connected domain G bounded by 
a piecewise smooth curve consisting of smooth curves yj, y2, ..., Y, on a surface 
M?; we denote the angles between the curve-pieces at the corners by a1, @2,..., Qn 
(see Fig. 3.1). The angles in the corners can take values from 0 to 27 (in the figure 
the angles a; and a3 are larger than 7), and the exterior angles 7 — a, can take 
values from —z to z. Recall that a simply connected domain is always orientable. 

To prove the Gauss—Bonnet theorem, we need Umlaufsatz (Theorem 1.2) in 
a form more general than that proved above: Given a vector field ¢; without 
singularities on a simply connected domain bounded by a smooth positively oriented 
curve on a surface, the tangent vector €; makes one turn in the positive direction 
with respect to ¢; while traversing the curve. Umlaufsatz implies this assertion only 
for a plane domain and a constant vector field ¢;. There is no substantial difference 
between a constant vector field and a vector field ¢; without singular points: the 
index of a vector field without singular points is zero, and hence ¢; makes zero turns 
as the curve is traversed. Consider a plane chart parameterizing the given simply 
connected domain. Let (, )g be the inner product of vectors in the plane at some 
point, and let (, ); be the inner product of the corresponding tangent vectors to 
the surface at the corresponding point. According to Umlaufsatz, if angles between 
vectors are measured by using the inner product (, )o, then the velocity vector 
of the curve makes the required number of turns. Therefore, it suffices to check 
that the number of turns is the same when angles are measured by using the inner 
product (, );. To this end, consider the family of inner products t(, ); +U—1)(, )o, 
0 <t < 1. Since the number of turns must vary continuously with ¢, it is constant. 


Theorem 3.5 (Gauss—Bonnet Formula) Let G be a simply connected domain 
bounded by a piecewise smooth curve consisting of smooth curves y\, 2, --+» Yn 
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on a surface M2, and let a, a2, ..., Q, be the angles between these curves at the 
Joint points. Then 


ay | keds +r a) = 20 f Kdo. 
Vk k G 


k 


In particular, if the bounding curve y is smooth, then 


| %as = 20 - [ Kdo. 
y G 


Proof First, consider the case where the curve y is smooth. By assumption the 
domain G is simply connected, and hence there exists a vector field ¢; on G without 
singular points; we can assume that each vector ¢; has unit length. Let us complete 
the vector ¢; to an orthonormal positively oriented basis ¢1, €2 in the tangent plane 
and construct a form or for which da; = —Kw! \ w* = —Kdo. By Stokes’ 
formula — [, Kdo = i, ore 

In a neighborhood of the curve y, in addition to the vector field ¢;, we can 
consider a vector field €; whose restriction to y is the tangent vector field on y. 
Using the vector field 1, we construct a form Or. We have or = or + dg, where 
gy the angle of rotation from the vector &; to the vector ¢; (see the remark after 
Theorem 3.4). Therefore, f, wr = jy or 7 I dg = { keds — 2m, because, 
according to Umlaufsatz, as the entire curve is traversed, the angle g changes by 
27, and the angle of rotation is measured in the opposite direction: from the tangent 
vector €, to the vector ¢1. 

In the general case of a piecewise smooth curve, the exterior angles 2 — a at the 
joint points are included in the integral /- ' dg. oO 


HISTORICAL COMMENT In 1827 Gauss proved that the total curvature of a 
geodesic triangle (i.e., the integral of Gaussian curvature over a geodesic triangle) 
equals the sum of its angles measured in radians minus 7. The total curvature of 
a geodesic polygon equals 27 minus the sum of exterior angles. In 1848 Bonnet 
generalized this statement from geodesic polygons to general closed curves; in the 
case of curves, the sum of exterior angles of a geodesic polygon is replaced by the 
integral of geodesic curvature along the curve. 


Problem 3.18 Prove that two geodesic lines on a surface which intersect in two 
points cannot bound a simply connected domain (digon) with nonpositive Gaussian 
curvature at all points. 


Theorem 3.6 (Gauss-Bonnet) Let M? be a closed oriented surface in R>. Then 


I. Kdo =2n- x(M’), 


where x (M7) is the Euler characteristic of the surface M?. 
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First Proof We cut the surface M? into curvilinear polygons and apply the Gauss— 
Bonnet formula. Suppose that the number of polygons is f, the number of edges is e, 
and the number of vertices is v. Let us show that the summation of f Gauss—Bonnet 
formulas yields 


2rme—2nv=2nf — Kdo. 
M2 


For the right-hand side, this is obvious; consider the left-hand one. First, the integrals 
iy kgds occur twice with opposite signs for each edge, so that they cancel one 
another out. Secondly, summing the terms }°, (a — ax), we obtain 27e — 27v. 
Indeed, the number of terms equal to mz coincides with the number of all vertices 
in all polygons (each vertex is counted as many times as the number of polygons 
containing it); therefore, it equals the number of all edges in all polygons, which, in 
turn, equals 2e, because each edge belongs to precisely two polygons. Finally, the 
terms a at each of the v vertices amount to the full angle 27. 
As aresult, we obtain 


/ Kdo = 2n(v—e+ f) =2n- x(M?). 
M2 


oO 


Second Proof (Not Using the Gauss—Bonnet Formula) Let us construct a vector 
field V(x) on M? with finitely many singular points. The sum of their indices equals 
x(M7). Let |_| De; be the union of e-neighborhoods of the singular points; we 
assume that ¢ is sufficiently small and these e-neighborhoods are disjoint. Then 


/ Kdo = lim Kdo. 
M? &>0JM\| | Dei 


The vector field under consideration has no singular points on M? \ |_| Dei; 
therefore, using this field, we can construct a moving frame ¢€1, €2 and consider 
the dual frame w!, w?. From this dual frame we construct a form wr for which 
do; = —Ka! Aw. Applying Stokes’ formula, we obtain 


/ Kdo = [ Kol nw? = — f aot = — [ OW; . 
M?\| | Dei M?\| | Dei M\[| Dei LI Sei 


Together with the initial vector field V(x), consider a “constant” vector field 


V(x) on D,,;, that is, a vector field constant in some coordinate system on D,,;. 
Using V (x), we construct the form or. 
In Stokes’ formula the orientation of the boundary of a two-dimensional surface 


is defined as shown in Fig.3.2. Therefore, the orientations of the circles S,,; are 


negative (i.e., clockwise). We have Or = or + dg, where ¢ is the angle of rotation 
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Fig. 3.2. Orientation of the 
boundary 2 


< 


from the vector V(x) to the constant vector V(x). Hence the vector V(x) makes an 
angle of —@ with the constant direction. Therefore, while traversing the whole circle 
in the positive direction, the angle g changes by —2z ind;, where ind; is the index 
of the given singular point, and while traversing the circle in the negative direction, 
gy changes by 27 ind;. Thus, 


[ a=] wy + 2m ind; . 
EL Se,i 


Moreover, 


As aresult, we obtain 


Kdo =2 ind; = 27 - x(M?). 
I. oO [ind mz - x(M~) 


l 


3.8 Christoffel Symbols 


Given a surface r(u!, w2), consider the unit normal vector n and the tangent vectors 


r= = and r2 = a at each point. Differentiating the equation (n,n) = 1, we 


obtain (n;,n) = 0, where n; = oni = |, 2. Therefore, the vectors n; are expressed 
in terms of r; and r2 as nj = of r;. Let us calculate the coefficients rs . On the one 
hand, (n;, r,) = —bjz, where (b;x) is the matrix of the second quadratic form. On 


the other hand, (n;, r,) = a (j,.7h) = ci Sik, where (g jx) is the matrix of the first 
quadratic form. Let b! = bixg. Then e = a ging! = —dig" = —bi. Thus, 


nj = —b! rj, where b! —| big (Weingarten equations). 
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In particular, if the vectors r; and r2 correspond to principal curvatures k; and ko, 
then nj = — fit; = —kjrj;, 1e., 


or on | 
‘aa + a =0 (Rodrigues’ formula) 


(the summation with respect to the repeated index i is not performed). 


HISTORICAL COMMENT The Weingarten equations were obtained by Julius Wein- 
garten (1836-1910) in 1861. Rodrigues’ formula was proved by Benjamin Olinde 
Rodrigues (1795-1851) in 1815. 


Problem 3.19 Prove that if the principal curvatures of a closed surface are equal 
and different from zero at each point, then the surface is the sphere. 


HISTORICAL COMMENT The statement in Problem 3.19 was proved by Meusnier 
in 1785. The points at which the principal curvatures are equal and different from 
zero are called umbilical. 


In each tangent plane to a surface we can consider the linear operator L with 
matrix (b! ). This operator is called the Weingarten operator. It is related to the 
second quadratic form B as B(u, v) = (Lu, v). Geometrically, the operator L is 
defined as Lu = —0,n (the derivative of n in the direction of uw). 


Problem 3.20 Prove that L? = 2HL — KI, where H is the mean curvature, K is 
the Gaussian curvature, and / is the identity operator. 


The vector rj; is expanded in 


_ F " 2 
Now consider the second derivatives rj; = , 


the basis n, r}, r2 as 
tk 3 
rig = Vyjrk + ajjn. 


Let us calculate the coefficients 9 and a;;. First, note that (r;, 1) = O, and therefore 
(rij,n) = ajj,1.¢., the ajj = Dj; ae the entries of the matrix of the second quadratic 
form. The formula 


lij =T* ilk + bin (3.3) 


will often be used in what follows. 

Next, we have (rjj,17) = (Tire, 1) = Ti Skis whence ry, = ri gn” = 
(rij, rig"? 

The numbers ri. are called the Christoffel symbols, and the formula rjj = 
r t rg +bjjn is known as Gauss’ formula. It follows from the symmetry of the second 
derivative (rj; = rj;) that the Christoffel symbols are symmetric with respect to the 
subscripts: ri, — Mii 
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HISTORICAL COMMENT In 1869 Elwin Bruno Christoffel (1829-1900) considered 
conditions for the coincidence of Riemann geometries with first quadratic forms 
gijdxjdxj and h;;dx;dx;. This generalized the superposition problem for surfaces 
to the many-dimensional case. The condition that the geometries coincide was 
expressed in terms of the Christoffel symbols. In the two-dimensional case, the 
Christoffel symbols for surfaces were known to Gauss; they are mentioned in his 
1827 paper. 


Theorem 3.7 (Gauss) The Christoffel symbols are expressed in terms of the first 
quadratic form (g;j) as 


pk a pe (98 _ 88ii , 28u) 
2 dus aul ut J 


Proof Let Vij = Vi 8k — (rij, r1). Clearly, rij = jis therefore, Vij — DV jit. 
Differentiating the equation gj; = (rj, rj), we obtain 


ij 
Bah = ib) + Cary) = Taj + ju. 
Similarly, 
gil d8ij 
a Diy + yi, dae Pyaij + Uji. 
Thus, 
1 (as _ ay , ay) _ 
2\aul oul " aut) 
It is also clear that ri, = 8 gyiTi; = eT. 7 


Problem 3.21 Calculate the Christoffel symbols for polar coordinates (r, g) in the 
plane. 


Problem 3.22 Calculate the Christoffel symbols for a surface of revolution 


(f(u) cos v, f(u) sin v, u). 
Problem 3.23 Let ri, be the Christoffel symbols in coordinates (x!, x7), and let 
E : q be the Christoffel symbols in coordinates ( y!, y?). Prove that 


nt (pe a” at, ayt ash 
y PL axt Oxi Axidxi } dys” 


Problem 3.24 Prove that, at any point of a two-dimensional surface, local coordi- 
nates can be chosen so that all Christoffel symbols at this point vanish. 


3.9 The Spherical Gauss Map 87 
3.9 The Spherical Gauss Map 


For the sphere in R? considered not only purely topologically but also geometrically, 
i.e., as a surface with first quadratic form, we use the notation S?. 

Given a smooth oriented surface M? in R?, the spherical Gauss map f : M? > 
S? takes each pointx « M ? to the head of the unit normal vector n(x) to the surface 
M? at x. 

dn on 


According to Rodrigues’ formula, we have [> = hig and = = =ky Xs. 


Therefore, an infinitesimal parallelogram of area ler x a du' du? on the surface 


M? is mapped to an infinitesimal parallelogram of area 


on on 
aul * Wie 


or 0 
= [kiko — x f 


du'du eee 
ae dul” du2 


on the sphere. Since the ratio of oriented areas equals K, we obtain the following 
interpretation of Gaussian curvature: 


_ the oriented area of f(A) on S? 
~ the oriented area of A on M2 ’ 


where A is an infinitesimal figure at a point x € M7. Thus, 
Kdoy2 = f*dog, (3.4) 


where doy2 and dog: are the area forms on M? and on S?. 

Relation (3.4) can also be obtained in the language of differential forms. Recall 
thatn = 3 and Ka! Aw? = —dwt = oF A 3. Here w! A w” is the area form on 
the surface M7. The vectors ¢; and ¢ form a basis of both the tangent space to M* 
at the point x and the tangent space to the unit sphere S? at the point f (x). For the 
sphere, the equation de3 = OxE1 +0362 is similar to the equation dx = we; +are2 
for the surface M7. Therefore, o; A o3 is the image of an area element of the surface 
under the spherical Gauss map. 


For any smooth map g: M” — N” of manifolds, we have 


[ s'o= eee) o, (3.5) 


N” 


where deg g is the degree of g. Indeed, using a partition of unity, we can reduce 
the proof to the case where the form is different from zero only in an arbitrarily 
small neighborhood. In the case where g maps homeomorphically several open balls 
to one ball, the task reduces directly to the definition of the degree of a map. The 
critical values can be ignored, because they constitute a set of measure zero. 
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Theorem 3.8 For a closed oriented surface M? C R?, the degree of the spherical 
Gauss map equals Xx (M?). 


Proof Applying Eq. (3.5) to the spherical Gauss map f, we obtain 


i f*dog = (aces) [ dog. = 4m - (deg f). 
M2 S2 


According to (3.4), we have f*dog. = Kdoy2. Moreover, by virtue of the Gauss— 
Bonnet theorem, tue Kdoy2 = 21 - x (M2). Thus, 421 - (deg f) = 27 - x (M), ice., 
deg f = 3x(M?). o 


HISTORICAL COMMENT Gauss introduced the spherical Gauss map in 1827, in 
studying curved surfaces. He used it to define the Gaussian curvature of a surface. 
Before Gauss, the curvature of a surface was defined in precisely the same way by 
Rodrigues in 1815. 


Problem 3.25 Prove that the differential of the spherical Gauss map f: M* — S? 
is a self-adjoint operator and the quadratic form associated with this operator is the 
second quadratic form with the minus sign. 


3.10 The Geodesic Equation 


Recall that a curve y(s) = r(u'(s),u?(s)) with natural parameterization on a 
smooth surface r(u!, u7) is said to be geodesic if its geodesic curvature identically 
vanishes. 


Theorem 3.9 A curve y(s) ona surface r(u', u*) is geodesic if and only if 


auk rk dui dul Oprk= 1,2 
oe CU Or — >. 
ds2 Yds ds 


Proof First, suppose that the parameterization of the curve is natural. Consider the 
: dul a 
velocity vector of the curve ¢; = Sor;, where rj = cae Clearly, 


de, du’ dui rn dui 4 0¢r 
—_— = —: "ij + — i. Where rj; = —— . 
ds ds ds "~ ds2' I Quidul 


Therefore, applying the Gauss formula (Eq. (3.3) on p. 85), we obtain 


(3.6) 


dey _ (px dui dul | dut a5 du’ dul 
ds \Uds ds dst J * 8 Gs ds 
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Recall that sea = kgé2 + kné3, where the vector €2 lies in the tangent plane and 
£3 =n. The wectbis r) and rz are linearly independent; it follows that ky = 0 if and 
only if both coefficients multiplying the vectors r; and rz vanish. 

A geodesic with any parameterization has the form y(as + b), where y(s) is a 
geodesic with natural parameterization and a and b are constants, a ~ 0. The curve 
y (as + b) satisfies the above equation if and only if so does the curve y(s). oO 


In Problem 3.22 we calculated the Christoffel symbols for a surface of revolution 
(f(u) cos v, f(u) sinv, u). This allows us to write equations for a geodesic on a 
surface of revolution. One of these two equations has the form 


It follows from this equation that, for a geodesic on a surface of revolution, the 
quantity f odv is constant. Indeed, 


d [( »dv ,(@v  _f' du dv 
— | f—)= —_+2--.—.—]= 
ds G *) f (G+ f ds ds si 


A visual geometric interpretation of the constancy of f 2gu is given by Clairault’s 
theorem. 


Theorem 3.10 (Clairault) Fora geodesic y ona surface of revolution, the product 
f sina, where f is the distance to the axis of rotation and a is the angle between 
the velocity vector of the geodesic and the corresponding meridian, is constant. 


Proof The vectors e, = (f’ cosv, f’ sinv, 1) and e, = (—f sinv, f cos v, 0) are 
sent to the ae and he parallel, respectively. It follows that, on the one 


hand, 4 s = cosa7— 1a T+ sina 2 7 by se definition of the angle a, and on the other 
— du dv 

hand, a direct calculation eed that 44 x“ = Feu + Fey. Therefore, sina = Fz ae 

Thus, f sina = pe is constant. Oo 


HISTORICAL COMMENT Alexis Claude Clairault (1713-1765) proved the theorem 
about a geodesic on a surface of revolution in 1733. 


3.11 Parallel Transport Along a Curve 


Let y(t) = r(u'(t), u2(t)), t € [a, b], be a curve on a surface r(u!, u2). Consider a 

smooth vector field V(t) on the curve y, i.e., a smooth a V: [a,b] > R?. The 

vector field V is said to be parallel along y if the vector 4 is perpendicular to the 

surface at each point. This fenmuuon does not depend on the parameterization of the 
dV _ dV . ds dv 


curve y, because 7- = G_ - 7,, 1.e., the vectors F- and 4Y are proportional. 
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For an arbitrary (not necessarily tangent to the surface) vector field, the definition 
of parallel transport along a curve on a surface involves the arbitrariness in the 
choice of a normal direction. But of greatest interest are vector fields that are tangent 
to the surface at each point. In what follows, we mainly consider vector fields with 
this property. A vector field on a surface can be represented in form V! aaa +v? 2. 

As we will shortly see, a vector field parallel along a curve is uniquely determined 
by any vector in this field; i.e., given a tangent vector, we can define a uniquely 
determined parallel transport of this vector along a curve. The geometric meaning 
of such a transport is as follows. The vector is translated to an infinitely close point 
of the curve in R*. The translate is not necessarily tangent to the surface. To obtain 
a tangent vector, we take the orthogonal projection of the translate on the tangent 
plane. Such is the geometric interpretation of the perpendicularity of the vector 
dV = V(t + dt) — V(t) to the tangent plane. 


Example 3.5 If acurve y lies in a plane, then a vector field in the plane is parallel 
along y if and only if it is constant (i.e., a vector is translated along y in the ordinary 
sense). 


Proof We can assume that the plane in which the curve y lies is given by the 
equation x3 = 0. Let V(t) = (V1(0), V2(1), 0). Then 4% = (Se. 4.0). This 


vector is orthogonal to the plane x* = 0 if and only if dy" = ee = 0. This means 
that V(t) = const. oO 
Theorem 3.11 A vector field V(t) = V' Orn is parallel along a curve y(t) if and 
only if 
dV" rk yi! 80 for k=1,2 
ties —=—— or = 1,24. 
dt J dt 
Proof First, note that 
dV dv* or ; or dyi 
dt dt  duk duldus dt © 


Next, according to the Gauss formula (Eq. (3.3) on p. 85), we have 


ar k or 
duiaul J auk aoa: 
Thus, the orthogonal projection of the vector ay on the tangent plane equals 


k dyl : : : 
oe + ri, 4 a) a Since the vectors on and a are linearly independent, 


it follows that the projection is zero (i.e., the vector field V(t) is parallel along the 
curve y) if and only if the equations in the statement of the theorem hold. Oo 


Corollary 1 A curve y is geodesic if and only if it velocity vectors form a vector 
field parallel along y. 
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Corollary 2 A vector field parallel along a curve y(t) is uniquely determined by 
any of its vectors. To be more precise, given any vector tangent to a surface at some 
point of a curve y(t), there exists a unique vector field V(t) parallel along y(t) and 
containing this vector. 


Proof To construct a vector field V(t) parallel along y(t) and containing the given 
vector, we must find a solution of a system of differential equations with given initial 
condition. This can be done in a sufficiently small neighborhood. It remains to cover 
the interval by a finite number of such neighborhoods. oO 


Corollary 2 suggests the definition of the parallel transport of a vector along a 
given curve. This transformation is a map of the tangent plane at an endpoint of 
the curve to the tangent plane at the other endpoint. It is easy to show that parallel 
transport preserves inner product. Indeed, if V(t) and W(t) are vector fields parallel 
along a given curve, then 


d _ (ava) dW(t)\ | 
Pree ae ( 7 wo) + (vin, a) =0, 


t ’ 


because the vectors av® and ano are orthogonal to all tangent vectors. Since 


inner product is preserved by parallel transport, it follows that so do the lengths of 
vectors and the angles between vectors. 


HISTORICAL COMMENT The theory of the parallel transport of a vector along a 
curve on a Riemannian manifold was developed in 1917 by Tullio Levi-Civita 
(1873-1941). 


According to Theorem 3.7, the Christoffel symbols are expressed in terms of the 
metric g;;. Therefore, the parallel transport of a vector along a curve on a cone is 
the same thing as the parallel transport along the image of this curve on a net of 
the cone, and the parallel transport of a vector along a plane curve amounts to an 
ordinary parallel translation. 

It is seen directly from the definition of a vector field parallel along a curve y that 
if two surfaces are tangent along y, i.e., the tangent planes to these surfaces coincide 
at each point of y, then any vector field parallel along y with respect to one of the 
surfaces is also parallel along y with respect to the other surface. For example, if 
we consider a cone and a sphere inscribed in it, then a vector field parallel along the 
circle of tangency y with respect to the sphere is the same thing as a vector field 
parallel along y with respect to the cone. Using this remark, we can describe the 
parallel transport of a vector along a circle on the sphere. 


Problem 3.26 A plane divides a sphere of radius r into two parts of areas S and 
4mr* — S. Parallel transport along the circle in the intersection of the plane with the 
sphere takes a vector V; to a vector V2. Prove that the angle between the vectors V1 
and V2 equals $/r?. 


92 3 Surfaces in Space 


The notion of parallel transport suggests the following geometric interpretation 
of geodesic curvature k, of a curve y(s) ona surface S. 


Theorem 3.12 Consider a parallel vector field V(s) on a curve y(s) with natural 
parameterization. Let g be the angle of rotation from the tangent vector €, = oy of 


the curve to the vector V. Then kg = ~ 


Proof Since the length of a vector is preserved by parallel transport, we can assume 
that || V(s)|| = 1. Then V = cos ge, +$sin gé2, where €2 is the vector in the Darboux 
frame. Recall that (¢1, £5) = —kg and (€2, £}) = kg. By assumption the vector field 
V is parallel along y; hence the vector V’ is parallel to the normal vector ¢3 at each 
point, i.e., V’ is perpendicular to the vectors €; and €2. Thus, 


0 = (€1, V’) = (e1, cos ge} + sin ge, — g' sin ge) + y' cos ger) 
= (61, €5 — g’€1) sing = —(ke +g’) sing, 


because (€1, a) = 0 and (¢€1, €2) = 0. A similar calculation for (€2, V’) shows that 
(kg + y’)cosg = 0. One of the numbers cos¢ and sing is different from zero; 


therefore, kg = —g’. Oo 


Corollary Parallel transport along a closed curve bounding a simply connected 
domain G rotates each vector through an angle of Ig Kdo. 


Proof Replacing the geodesic curvature in the Gauss—Bonnet formula 


[ kas = 20 - [ Kdo 
y G 
g 


by — 4, we see that the angle of rotation of a vector equals /, gKdoupto2x. oO 


3.12 Covariant Differentiation 


On any manifold the differentiation of a function in the direction of a vector can 
be defined. But an attempt to differentiate a vector field in the direction of a vector 
faces an insurmountable difficulty: we must consider differences of vectors lying 
in tangent spaces at different points, which requires that these tangent spaces be 
identified somehow. In the general case, this cannot be done, but in the presence 
of a metric, such an identification is possible. In fact, we have already performed 
differentiation in the definition of the parallel transport of a vector along a curve, 
because a vector field parallel along a curve is a vector field whose derivative in the 
direction of the tangent vector to the curve is zero. Thus, in fact, we defined what 
a zero derivative is. Now we give a general definition of the derivative of a vector 
field in the direction of a vector. 
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We can do this in two ways. First, we can repeat the construction used in the 
definition of the parallel transport of a vector along a curve in a more general form. 
Secondly, we can identify tangent spaces at different points of a curve by applying 
transport along this curve and use this identification to define differentiation. Below 
we give both of these definitions and prove their equivalence. 

Let f be a function on R”, and let V and W be vector fields on R”. Then we can 
define a function dw f (the derivative of f in the direction of W) by setting its value 
at a point P to equal the limit 


P W)-— f(P ; 0 
ii I PEW I). wip) (py, 

e—>0 E ox! 

We can also define a vector field dw V (the derivative of V in the direction of W). 
At a point P this derivative equals the limit 


fin ENO = VEY pie) ep), 
e>0 E ox! 

If V and W are vector fields on a surface § C R?, then the vector field dw V is 
not necessarily tangent to S. To obtain a vector field on S, we take the orthogonal 
projection of dw V on the tangent space at each point P € S. The vector field thus 
obtained is called the covariant derivative of V in the direction of W and denoted 
by Vw V. 

To calculate Vw V for a parameterized surface r(u!, u2), we need Eq. (3.3): 


ar 
rij 


k 
ag 


If W = Wir; and V = V'7;, then 


a ane avi Ori 
= J a. J , Jy! ce, 
= Vw ew 


evi a 
aN ee ae (Fyre + bin). 


To obtain the covariant derivative, it suffices to discard the component parallel to 
the normal vector n. Thus, 


avi a 
VwV = W) ri +wivi Dire. (3.7) 


In particular, 


Via T ee (3.8) 
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Now let us define covariant derivative by using the parallel transport of a vector 
along a curve and check that we come to the same definition. Let y be a curve on a 
surface S. We set a = W. Given a vector field V, let Vo and V, be its vectors at 
points y (0) and y(e), and let Vj be Vo transported along y to the point y (¢). Then, 
at the point y (0), we define 


Vv, -— Vi 
VwV = lim ——28, 


e>0 E 


We have 
OV OV! Ori 
Ve = Vo + eWi — + o(e) = Vo + eW/ —r; + eWiVI— 4 fe). 
Ou; Ou; ou! 
Moreover, it follows from the transport equation 


; re+ rv a =0 


(see Theorem 3.11) that 


dV av" ark iV! ari 
— = — 74+V =-Ty yi —— wivi'i—. 
dt dt. dt Gee aud 
Therefore, 
Jyipk f j Oi 
Vo = Vo — EW! V'Ts. gaa dere - + 0(&). 
Thus, 


i 


av i 
VwV = Wir, + WIV'TE ry, 
oud Wy 


which coincides with (3.7). 


HISTORICAL COMMENT Covariant differentiation was developed during the 
decade 1884-1894 by Gregorio Ricci-Curbastro (1853-1925), who named it 
“absolute differential calculus.” Ricci-Curbastro aimed at carrying over the usual 
procedures of calculus to Riemannian manifolds in such a way that they be invariant 
with respect to changes of coordinates. The most detailed exposition of this theory 
is contained in a 1900 joint paper of Ricci and his student Levi-Civita. In 1917 Levi- 
Civita introduced the notion of parallel transport along a curve on a Riemannian 
manifold, which had rendered Ricci’s formal constructions geometrically evident. 
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A direct calculation based on the expression for covariant derivative shows 
that the multiplication of one of the vector fields by a function f results in the 
transformations 


VewV =fVwV, VwfVY) = Ow fV + fVwV. (3.9) 


The symmetry of the Christoffel symbols with respect to the subscripts i, — 


ms ,) implies the symmetry of covariant differentiation: 
VwV —VvW = owV — oy W. 


The vector field dw V — dy W is denoted by [W, V] and called the commutator 
of the vector fields W and V. It is easy to check that 


Iw.vif = wv f) — dv Ow f). 


Indeed, dy f = vi or. ; therefore, 


i 2 
aw (av f) = aw (v4) lee aap 


ax! axd ax! Ox! Oxs” 
Hence 
avi __aw'\ af 
dw (0 — dy(0 = (| wi—-— vi 
w(ov f) — ov Ow f) ( as am) 


This is the derivative of the function f in the direction of the difference of the vector 
fields dw V = W/ 2% and dy W = V/Se 


axs° ; ; 
Thus, the ith coordinate of the vector [W, V] equals W/ ov —vV/ aw It is easy 
to derive from this expression that, for any function f, we have 


[fW.V]= flW,V]—Ovfow, LW, fV]= flW,.V]+ Owf)V. (3.10) 


Problem 3.27 Consider the vector fields rj = a andr2 = =, on a parameterized 


surface r(u! , u~). Prove that [r1, r2] = 0, i.e., the coordinate vector fields commute. 


In applications it is often required to find the covariant derivative of a vector 
field V given on a curve y(t) in the direction of the velocity vector of this curve. 
We denote this covariant derivative by V,,V. Let us discuss some properties of the 
covariant derivative V,/V. 

First, note that the covariant derivative V,,V along the curve y identically 
vanishes if and only if the vector field V is parallel along y. This assertion follows 
from the definition of covariant derivative, but it can also be proved by comparing 
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the equation of parallel transport along a curve (see Theorem 3.11) and the equation 
V,/V = 0. To this end, we write the parallel transport equation in the form 


dvi byidy! 
i ar ee 
dvi _ dy! avi 
and note that G- = 4; - Ou 


Theorem 3.13 For vector fields V and W onacurve y(t), 
d 
ae W) = (V,/V, W) + (V, V,/W). 
Proof We choose an orthonormal basis e; in the tangent space at the point y (0) and 


consider its parallel transport along the curve. At each point of the curve we expand 
the vectors V and W in the corresponding basis: V = V'e; and W = W/ e;. Clearly, 


i 


dv' dv 
Vy (V'e;) — ae + V'(Vyrei) = ap 


because V,,e; = 0 (the vector field e; is parallel along the curve). Therefore, 


dv' ; dW! 
(Vy V, W) (V, VW) = (ee W/e;) oF (V ei, dt ej) 
dv' dwi d 
= —W' 4+v'— =—(V,W 
dt * dt rr ) 


Covariant differentiation can also be extended to covector fields. A covector is 
an element of the dual space (a linear function on vectors), and a covector field is 
a differential 1-form. With each basis r; in a vector space a dual basis r/ in the 
covector space is associated; it is defined by the relation r/(r;) = 5! . The value of 
acovector T = TjrJ at a vector V = Vr; equals TV = 7; Vi. 

The covariant derivative VwT of a covector field T is defined so as to satisfy the 
following two conditions: 


(1) the Leibniz formula Vw (TV) = (VwT)V + T(VwV) holds, in particular, 
) ae 
V,,(TV) = (Vr, T)V + T(r, V) = (Vr TV +7) | > + TV" ds 
ous J 


(2) Vw(TV) is the usual derivative of the function TV in the direction of W, in 
particular, 


aT. 2 av! 
V-,7TV) = aaa + fa 
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Comparing the two expressions for V,;(7'V), we obtain 


aT; 
(7,7 + nerf - ai at) vise 


therefore, 


(Vr; T); — Sink = TV j;- 


3.13 The Gauss and Codazzi-Mainardi Equations 


‘ « “ 3 
Let us continue the calculation begun in Sect. 3.8, namely, calculate rj jx = — 


and coer the resulting expressions for rjjx and rjx;. Using the notation b = 


biz g* introduced in Sect. 3.8 and the relations 2 on _ _p/ “rj and rij = EF iri t bin 


aut I 
obtained in the same section, we arrive at 


a I 
rijk = a (Tiyn + bijn) 


or’. an 


— ti r dbij Foes 
= ae rE LG rik + sue ii Pye 


U 
= a ar + THT tm + Tb py 
= auk I [kim ij Plk auk ijORTI 


I dij avi Pp I 
={T; Pea cnt aae 2 + PP Ave bj jb), | 11. 


A similar calculation yields 


Obi ar! 
Tikj = (r Dl + at) n+ (= + ri _ bixb' r] 


But rjjk = rixj. Moreover, the vectors n and r are linearly independent. Therefore, 


OF Oe pce al 
Mal auk FVD py — PIV pe = bind 5 — Bij (3.11) 
and 
Ob;; Ob; 
ij ih = bi _ Ti bu. (3.12) 
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There are only two independent equations in (3.12): 


Ob; db;2 
nies +T},b12 +0 ?,b22 = =f +0 },bi +0 bo1 (3.13) 


fori = l andi =2. 


HISTORICAL COMMENT Equations (3.11) were obtained by Gauss in 1827. They 
are known as the Gauss equations. Gauss used them to express Gaussian curvature 
in terms of metric. The history of Eq. (3.13) is more intricate. They were first 
obtained by Karl Mikhailovich Peterson (1828-1881) in 1853, but his dissertation 
was published only a hundred years later. In this dissertation, he proved that the 
first and the second quadratic form determine a surface up to motion. Peterson 
supplemented the Gauss equations by two more independent equations and proved 
that if the coefficients of two quadratic forms (the first of them must be positive 
definite) satisfy these equations, then there exists a surface for which these forms are 
the first and second quadratic forms, and they determine this surface up to a motion 
of the space. (The first proof of this theorem was published by Bonnet in 1867; it 
is usually referred to as Bonnet’s theorem.) Later the same equations were obtained 
by Gaspare Mainardi (1800-1879) (in 1856) and Delfino Codazzi (1824-1873) in 
1859. For this reason, in Russia Eq. (3.12) are known as the Peterson—Codazzi 
equations and abroad, as the Codazzi—Mainardi equations. 


The Riemann curvature tensor of a two-dimensional surface is defined as 
1 I 

se 

tk ayi auk 


Pri Py 
aE ag hap pe 
In terms of this tensor, the Gauss equations can be written in the form 
l 1 l 
Rig = Dix; — bij dy. (3.14) 


Gauss used these equations to prove that Gaussian curvature is an intrinsic charac- 
teristic of a surface, 1.e., depends only on the metric. Gauss gave the name Theorema 
Egregium (‘remarkable theorem’) to this statement. 


Theorem 3.14 (Gauss) The Gaussian curvature K can be expressed in terms of 
the metric gj; alone. 


Proof Equation (3.14) implies 
Ri ¢Bim = bixb! bib} 
ijfk8lm = Vik | $lm — VijOKSlm 
= bixb jm — bij bem. 
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For i = k = | and j = m = 2, we obtain 
Ri, 812 = bi1b22 — bf, = det(bi;) 
= det ((b5)(gix)) = K det(gix). 
Thus, 


1 


= —— Rip) 812. 
det(gix) 1218 


It remains to recall that, according to another theorem of Gauss (see Theorem 3.7 
on p. 86), the Christoffel symbols can be expressed in terms of the metric gj,.  O 


It is easy to check that if the matrix (g;;) is the identity, then 


1 2 1 2 
Ry12 = Ry) = —Ry94 = —Rij2 =K 


and all the other components of the Riemann tensor are zero. 


3.14 Riemann Curvature Tensor 


The Riemann curvature tensor 


j 1 
a OV ix = ari; 


ik Oud auk 


Pri Pri 
TE! po Ve 
defines a trilinear function R(X, Y)Z: 

RO Pr y= 2 Rory: 


Theorem 3.15 The Riemann curvature tensor is expressed in terms of covariant 
differentiation as 


R(X, Y)Z = VxVyZ — Vy VxZ — Vx, ¥1Z. 


Proof First, we check the required equality for the basis vectors. For them, the 
commutator is zero (see Problem 3.27), so that the last term vanishes and what 
remains is 


Vij (Viti) — Vin (Vr Ti) = Vr; ie") ~ Vu ii? p) 


i r rer! 3 r 
Rp EE Ne pie ha 


l 
a ee oe 
~ ( dus 7 auk = Vig pj - A: TI. 


Ppl 
ry — Dy PD perl 
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Let us check the trilinearity of Vy Vy Z — Vy VxZ — Vix,y|Z. Applying (3.9) 
to the first two terms and the first equation in (3.10) to the last term, we see that 
when X is replaced by fX, where f is a function, the terms Vx Vy Z, —Vy Vx Z, 
and —Vrx,y)Z become, respectively, 


VexVYZ = fVxVyZ, 
—VyVexZ = —(0y f)VxZ — fVyVxZ, 
—Virx,y1Z = —Veix,yie@ypxZ = —fVixy1Z + Oy fyVxZ. 


Thus, their sum equals the initial expression multiplied by f. For Y, the calculation 
is similar. For Z, we have 


Vx Vy(fZ) = Vx((y f)Z + fVyZ) 
= (dx dy f)Z + (Oy f)VxZ + (Ox f)VyZ + fVxVyZ, 
—V¥Vx(fZ) =—(ovdx f)2— (xf) Vr2 — (ay f)VxZ — FVrVxZ, 
—VixyVfZ) = -Ox AZ — fVixyZ. 


Summing these expressions and taking into account the identity dydy — dydy = 
Ocx,y], we obtain the required expression. oO 


3.15 Exponential Map 


It follows from the geodesic equation that, for each point p of a surface S, given 
any tangent vector V at this point and a sufficiently small rf, there exists a unique 
geodesic yy (t) for which dy (0) = V and yy (0) = p. Suppose that yy (r) is defined 
for t = 1. Then we set exp,(V) = ypy(1). 


Theorem 3.16 The differential of the map exp, at the origin is the identity 
operator. 


Proof In the tangent space at the point p consider the curve a(t) = tV, where V is 
a fixed vector. For t = 0, the velocity vector of this curve equals V. The map exp, 
takes this curve to the curve yy (1) = yy (ft). For t = 0, the velocity vector of yy (t) 
on the surface S equals V. oO 


Corollary The restriction of the map exp p 10 a sufficiently small neighborhood of 
zero in the tangent space is a diffeomorphism. 


Proof The differential of the map exp p at the origin is a nonsingular operator; 
therefore, by the inverse function theorem, the restriction of the map exp, to a 
sufficiently small neighborhood of zero in the tangent space is a diffeomorphism. 

oO 
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For a fixed vector V, the curve c(A) = exp p (AV) is geodesic. Indeed, since 
yrv(t) = yv (At), it follows that eXPp QV) = nv) = wi). 

The length of the velocity vector of the geodesic yy (t) at each point equals || V ||; 
therefore, the length of a geodesic arc from the point p to the point exp,(V) = 
yv (1) equals ||V||. For this reason, the curve {exp,(V)|||V|| = r} is called the 
geodesic circle of radius r centered at p on the surface S. 

For a surface S in R*, the map exp p makes it possible to introduce so-called 
normal coordinates in a small neighborhood of the point p € S. Normal coordinates 
correspond to the image of a Cartesian coordinate system in the tangent space at 
the point p under the map exp,. Let e1, e2 be an orthonormal basis, and let q = 
exp, (x!e, + x2e2). Then (x!, x?) are the normal coordinates of the point q. 

At the origin the differential of exp, is the identity map; therefore, in normal 
coordinates at the point p the first quadratic form is (dx!)* + (dx?)’, ie., E( p= 
G(p) = Land F(p) = 0. 

In normal coordinates the geodesic lines through the point p are determined by 
the equations x! = tc! and x? = tc?. 


Problem 3.28 Prove that in normal coordinates all Christoffel symbols at the point 
p are zero. 


The map exp, makes it also possible to introduce geodesic polar coordinates in 
a small neighborhood of a point p € S as the image of polar coordinates (r, g) in 
the tangent plane at the point p under the map exp,. In geodesic polar coordinates 
the equation r = const determines a geodesic circle centered at p, and the equation 
gy = const determines a geodesic line passing through the point p. 

Let us calculate the coefficients of the first quadratic form Edr? + 2F dr dg + 
Gd¢? in geodesic polar coordinates. The velocity of motion along the coordinate r 
is 1, whence E = 1. The curve with natural parameterization s which is determined 
by the equation g = const is geodesic. For this curve, we have oe # Oand ae = 0; 
therefore, le = 0. 

In the case of surfaces, in which there are only two indices, the equations 
(rij,71) = ry, gk (see p. 85) are often more convenient than the general expression 
for the Christoffel symbols in terms of the metric. Let us write two such equations: 


1 1 
Ce + [iF = (r{1,71) = xP ee aes T2.G = (rj1,r2) = Fi -— 52 
(here E; and F are derivatives with respect to r and E> is derivative with respect 
to g). Since E = | and ie = 0, it follows from the first equation that Ti = 0 and 
from the second equation that F; = 0, i.e., F does not depend on r. 
Consider the geodesic circle a(g) centered at p and the geodesic line y(r) 


passing through p. At their intersection point, we have F = (4. a . Let us fix 
gy and consider the limit lim,_.9 F(r, g). The geodesic circle w(g) contracts to the 


point p; therefore, lim,_,9 (42. +) = 0. But F does not depend of r, whence 


F = 0. Thus, we have proved the following assertion. 
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Theorem 3.17 (Gauss’ Lemma) The geodesic exp,(tVo) from the center of the 
geodesic circle {exp,(V) | || V || = r} is orthogonal to this circle. 


HISTORICAL COMMENT In 1825 Gauss proved that the set of endpoints of arcs of 
the same length on geodesics from the same point of a surface is a curve intersecting 
all these geodesics at right angles. 


Thus, in geodesic polar coordinates, the first quadratic form is dr2+ G(r, g)dy”. 
It follows by Theorem 3.3 (see p. 76) that the Gaussian curvature K can be 
calculated by 


Icy? iG 
4G°2K = ({—) —~2G——. 
ar ar2 


A simple calculation shows that this formula is equivalent to 


1 2. 
r 


a/G 


or 
coordinates x! = rcosy, x* = rsing. Under a change of coordinates the area 


form is multiplied by the Jacobian determinant of this change; therefore, 


1 2 
JG =VEG- FP =VEG~ Poe) = EGP. 


d(r, ~) 


Let us prove that lim,_.9 G = 0 and lim,;_,9 = |. Consider the normal 


At the point p we have E = G = 1 and F = 0 for the normal coordinates; hence 
ave 24, 


lim,.9 G = 0 and lim,_.o +, 
Let us write Eq. (3.15) in the form 


2: 
uve __KVG (3.16) 


and differentiate it with respect to r: 


a3./ af K 
ONG gg NO ee 
ar3 or ar 


Hence 


eJ/G 
K(p) = li 
(P) bu ar3 
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Note also that (3.16) implies 


_ ?VG 
lim =0 
r>0 dr? 


Thus, we know all coefficients in the decomposition 


aV/G r? 02-/G r? PVG 
0,.~+—-—-0,9)+——% 
or 3! or 


JVG(r, 9) = VG, ¢) +r 
or 2! 


(0, y) +o(r?) 


and can write it in the form 
r 
VG(r, 9) = 1 —  K(p) + 007). 


This formula makes it possible to express the Gaussian curvature at a given point in 
terms of the lengths of geodesic circles of sufficiently small radius or in terms of the 
areas of geodesic disks of sufficiently small radius. 


Theorem 3.18 (Bertrand—Diguet—Puiseux) 
(a) Let L(r) be the length of a geodesic circle of radius r. Then 


Qar—L 
Kelim 2 oer LO. 
r>0 70 r3 


(b) Let A(r) be the area of a geodesic disk of radius r. Then 


2 
Coin 
r—>0 IT r4 


Proof 


(a) Let us introduce geodesic polar coordinates (7, g) in a neighborhood of a point 
p. Fix r and consider a geodesic circle a(g) of radius r; the point a(g) has 
geodesic polar coordinates (r, g). The length of the velocity vector of this curve 
at the point (7, g) equals IG (r, p); therefore, 


2n 
Lir) = Vv G(r, y)dy 
0 
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(b) Clearly, 


2n r 
Awy= faa f i VG(r, g)dr dg 
0 0 


2n r Kr 2n r 
= i (-- —) dray+ [ / o(r?)dr dg 
o Jo 6 0 Jo 


= r2 Kr‘ te (r4) 
= £70 > A OV"). 
Oo 


HISTORICAL COMMENT Theorem 3.18 was proved by Bertrand, C. F. Diquet, and 
Victor Alexandre Puiseux (1820-1883) in 1848. 


Geodesic polar coordinates can also be applied to prove that a geodesic locally 
minimizes arc length. 


Theorem 3.19 Any point p on a surface S has a neighborhood U such that the 
length of a geodesic y going from the point p to some point q and lying entirely 
in the neighborhood U does not exceed the length of any curve a on S joining the 
points p and q. Moreover, if the lengths of the curves y and a are equal, then y and 
a coincide as nonparameterized curves. 


Proof Consider the image of a neighborhood of the origin in the tangent space at the 
point p such that the restriction of exp, to this neighborhood is a diffeomorphism 
and let U be an open geodesic disk of radius ¢ lying entirely in this image. If g € U, 
then the geodesic circle centered at p and passing through q is contained in U. 
First, consider the case where the curve a(t) = (r(t), g(t)), t € [a, D], lies inside 
the geodesic circle. We have 


b b b 
L(a) =) V wp +ypar> f V(r')?dt >| r'dt = L(y), 


and the equality is attained if and only if ¢ = const and r’ > 0. In this case, the 
nonparameterized curves y and @ coincide. 

If the curve a intersects the geodesic circle, then its length is at least the radius of 
this geodesic circle, and the equality is possible only if the nonparameterized curves 
y and a@ coincide. oO 


3.16 Lines of Curvature and Asymptotic Lines 


At each point of a smooth curve on of a surface, three quantities are defined: 
the geodesic curvature k,, the normal curvature k,, and the geodesic torsion 2g. 
Moreover, the normal curvature and the geodesic torsion depend only on the 
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direction of the curve at the given point; they are the same for all curves with 
common tangent line. 

We have already discussed the curves for which kg = 0 at all points (geodesics). 
The curves for which k, = 0 (asymptotic lines) or x, = O (lines of curvature) 
are interesting in many respects as well. As we will see, asymptotic lines and lines 
of curvature differ substantially from geodesics: through any point we can draw a 
geodesic in any direction, while lines of curvature and asymptotic lines can be drawn 
in only two directions (moreover, asymptotic lines exist only at points of negative 
Gaussian curvature). The lines of curvature bisect the angles between asymptotic 
lines. 

A line of curvature is a smooth curve on a surface S such that the direction 
of its tangent at each point corresponds to one of the principal curvatures of the 
surface at this point. The lines of curvature are integral lines for the two direction 
fields corresponding to the directions of principal curvatures. At the nonumbilical 
points (i.e., at those points at which the principal curvatures are different) the lines 
of curvature intersect at a right angle. 

On the sphere and in the plane any curve is a line of curvature, because all points 
of the sphere and the plane are umbilical and all directions are directions of principal 
curvature. 

Rodrigues’ formula shows that the eigendirections of the differential dn, where n 
is the normal vector to the surface, are the principal curvature directions. It follows 
that the lines of curvature are the curves consisting of points at which the geodesic 
torsion x, vanishes. Indeed, recall the third formula for the Darboux frame: £3 — 
—kné| — 2&2; here €3 = n and ¢€| is a tangent vector to the curve. Thus, a tangent 
vector to a curve is an eigenvector of the operator dn if and only if x. = 0. 


Problem 3.29 Prove that the parallels and meridians of a surface of revolution are 
lines of curvature. 


Problem 3.30 Prove that if two surfaces intersect along some curve y at a constant 
angle and this curve is a line of curvature on one of these surfaces, then it is also a 
line curvature on the other surface. 


If K(p) < 0, then there are two directions at the point p for which the normal 
curvature k, is zero (in other words, the second quadratic form vanishes in these 
directions). These directions are said to be asymptotic. They divide the tangent plane 
into four sectors, in each of which k, has constant sign. On a surface of negative 
Gaussian curvature asymptotic lines are defined, whose tangents have asymptotic 
directions. Clearly, if a surface contains a straight line, then this line is asymptotic. 


Problem 3.31 Prove that, for an asymptotic line, the Darboux frame coincides with 
the Frenet—Serret frame (under an appropriate choice of the direction of the normal 
vector to the surface). 


Problem 3.32 Prove that the asymptotic directions at a point p are perpendicular if 
and only if the mean curvature vanishes at this point. 
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Both the lines of curvature and the asymptotic lines can be the coordinate lines 
of some parameterization in a neighborhood of a given point under the natural 
additional assumption that this point is nonumbilical (for lines of curvature) or is 
a point of negative Gaussian curvature (for asymptotic lines). Both these assertions 
about the existence of a parameterization follow from a general theorem, which we 
will now prove. 


Theorem 3.20 Suppose that on a smooth surface r(u, v) in a neighborhood of a 
point with coordinates (uo, v9) two differential equations 


Ai(u, v)du+ Bi(u,v)dv=0, Adu, v)du + Bo(u, v)dv =0 


are given, and their coefficients satisfy the condition A, By — A2B, 4 0 at the point 
(ug, vo). Then in a neighborhood of this point the surface can be parameterized so 
that the coordinate lines are integral curves for these equations. 


Proof We assume that A2B, 4 0. Let v = g(a, u) be the solution of the first 
equation with initial condition g(a, uo) = a, and let u = w(, v) be the solution of 
the second equation with initial condition (6, v9) = 6. The equations v = g(a, u) 
and u = w(, v) are solvable with respect to a and 6 in a neighborhood of the point 
(ug, v9), because i = | atu = uo and A = latv = vo. Leta = au, v) and 
B = Bu, v) be their solutions. We claim that a, 6B, — a8, 4 0 at the point (uo, vo). 
Indeed, the curve a(u, v) = const is an integral of the first equation, and hence the 
equations A;du + Bidv = O and a,du + a,dv = 0 are proportional, i.e., Ajay = 
Bay, a, # 0. Similarly, A2By = B2By, By 4 0. Suppose that a, By = ay B,. Then 
A2Biay By = A, Bray By = A, Bray By, and a, By ~ 0. Therefore, A; By = A2B}, 
which contradicts the assumption of the theorem. Thus, a8, — a,f, 4 0 at the 
point (uo, vo), and hence a(u, v) and B(u, v) can be taken for new parameters of 
the surface. For these parameters, the coordinate lines w = const and 6 = const are 
integral lines of the equations under consideration. Oo 


The system of equations in Theorem 3.20 can be given in the form of the product 
of these equations, i.e., as an equation of the form 


Adu? +2Bdudv+Cdv* =0. 


The coefficients in this equation must satisfy the inequality AC — B* < 0. 
The asymptotic lines are the solutions of the differential equation 


Ldu? +2Mdudv+Ndv’ =0, 


where L, M, and WN are the coefficients of the second quadratic form. Moreover, at 
a point with negative Gaussian curvature, we have LN — M? < 0. 
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The direction (du : dv) is the direction of one of the principal curvatures if the 
vectors 


E F\ (du\ _ (Edu+Fdv anal LM\(du\ _ (Ldu+Mdv 
FG) \dv)” \Fdu+Gdv MN) \dv})” \Mdu+Ndv 
are proportional, i.e., 


Edu+Fdv Fdu+Gdv 
Ldu+Mdv Mdu+WNdv 


This equation can be written in the more symmetric form 


E F G 
L M N/=0. 
du* dudv dv* 


In a neighborhood of a nonumbilical point it has the form 
Adu” +2Bdudv+Cdv* =0, 


and through each point of this neighborhood precisely two integral curves pass. 
Therefore, AC — B* < 0, and we can apply Theorem 3.20. 

In the parameterization r(u, v) whose coordinate lines are lines of curvature 
many expressions become simpler. For example, in these coordinates the first and 
second quadratic forms are E du2 + G dv? and ky E du2 + koG dv’, where k, and 
kz are the principal curvatures. Indeed, it follows from the orthogonality of the 
curvature lines that F = 0, and the coefficients of the second quadratic form are 
obtained by applying Rodrigues’ formula. The Codazzi—Mainardi equations become 
substantially simpler in these coordinates as well, because M = 0: 


Ly = LT},-—NT?,, Ny = NT}, — LT}. 


Here L = kj E and N = kyG. We also have 


2 
M=—-7q@ Tres 7p bn=-FR 


Thus, the Codazzi—Mainardi equations take the form 


E G 
(ki E)y = a +k), (keG)y = a + ky). 
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3.17 Minimal Surfaces 


A minimal surface § C R? is a surface with zero mean curvature. 

To explain the name, consider the normal variations of the surface. Let D be 
a bounded domain of the surface, and let D be its closure. Given a differentiable 
function h: D — R, the normal variation of the surface r(u, v) determined by 
the function h is the family of surfaces r;(u, v) = r(u, v) + th(u, v)n(u, v), where 
n(u, v) is the unit normal vector to the surface at the point r(u, v). 

Recall that the area of a surface is given by the integral f VEG — Fdu dv (see 
formula (3.1) on p. 66). Therefore, to evaluate the area of the surface r; for small f, 
we calculate the first quadratic form of r;. To this end, we first calculate the partial 
derivatives (7;)y = oe and (r;)y = ot: 


(hu =y t+ thn, + thyn, 
(ry)y =ly + thny + thyn, 


The coefficients of the first quadratic form of the surface r; are equal to the pairwise 
inner products of these partial derivatives: 


E, = E + 2th(ry, ny) + t7h? (ny, My) + thy hy, 
Fy = F +thl(ru, nv) + Cvs tu) + Py, ny) + Phulry, 
G; = G+ 2th(ry, ny) + 17h? (ny, ny) + Phyhy. 

If we are interested in the coefficients of the first quadratic form of r; up to the 
first order in ¢, then we can express them in terms of the coefficients of the first and 
second quadratic forms of r, because 

(TusMu) = —L,  (u,Nv) + v, Nu) = —2M, (Ty, Ny) = —N. 
Therefore, 


E,G; — F? = EG — F* — 2th(LG — 2MF + NE) + o(t). 


Recall that the mean curvature H is expressed in terms of the coefficients of the first 


and second quadratic forms as H = 5 : peek (see Theorem 3.2 on p. 75). 
Hence 


E,G, — F? = (EG — F’)(1 — 4th) + o(t). 
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For the area A; of a normal variation of the domain D, we obtain the expression 


Aw =f JEG, — Fedu dv 
z [ /1—4thH + oVEG — F2du dv. 
Therefore, 
A'(0) = — [ 2H VEG = Pade. 


This formula shows that a surface is minimal if and only if A’(0) = 0 for any 
normal variation of any domain of this surface. Indeed, if a surface is minimal, then 
H = Oat all of its points. Suppose that H(p) 4 0 at some point p and consider 
a function / vanishing outside a small neighborhood D of p and taking values of 
the same sign as h(p) = H(:p) at those points at which it does not vanish. We have 
A’(0) < 0 for the variation determined by the function h. 


HISTORICAL COMMENT The equivalence of the minimality of the area of a surface 
with given boundary to the vanishing of the mean curvature of this surface was 
proved by Meusnier in 1776. 


In the differential geometry of surfaces it is sometimes convenient to use 
isothermal coordinates, for which the angles between curves on the surface are 
equal to the angles between the corresponding curves in the coordinate plane, 
i.e., the map of the coordinate plane to the surface is conformal (preserves the 
angles between curves). For isothermal coordinates, the first quadratic form is 
7 (u, v)(du? + dv’). 


HISTORICAL COMMENT The existence of isothermal coordinates in the case of a 
real analytic metric was proved by Gauss in 1822. 


In isothermal coordinates with first quadratic form 47 (du? + dv~) the expression 
for the mean curvature takes the form H = Ltn where L and WN are the coefficients 
of the second quadratic form. Using this expression for H, we can prove the 
following stronger statement (from which the above formula is obtained by taking 


inner product with the normal vector). 


Theorem 3.21 Let r(u,v) be an isothermal parameterization. Then ryy + ryy = 
222 Hn, where n is the normal vector. 


Proof Differentiating the equation (7,,7r,) = (’y,1v) with respect to u and the 
equation (r,, 7y) = 0 with respect to v, we obtain 


(uu, Tu) = uv, Tv) = —Cus Tov): 
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Therefore, (Fyn + vv, ’u) = 0. Similarly, (run + rov, vy) = 0. Hence the vector 
ruu + Pvy is proportional to n. The proportionality coefficient satisfies the equation 


H= a , because 


(nN, Pun + vv) = L+N = 207H. 


Thus, ruxn troy = 202 Hn. oO 


For a function f (u, v), the Laplace operator (Laplacian) Af is defined as 
a> fF af 


Af =—>~+—. 
f au2 " dv2 


A function f is said to be harmonic if Af = 0. Theorem 3.21 readily implies the 


following description of minimal surfaces. 


Corollary Let (x! (uw, v), x7 (u, v), x3(u, v)) be an isothermal parameterization of 
a surface. Then this surface is minimal if and only if the functions x'(u, v), x*(u, v), 
and x3(u, v) are harmonic. 


Example 3.6 A surface of the form 
r(u, v) = (acoshucosu, acoshv sinu, av) 


is called a catenoid. A catenoid is a minimal surface. 
Proof It is easy to check that E = G = a? cosh? v, F = 0, and ry, +ryy =0. O 


Problem 3.33 Prove that any minimal surface of revolution different from the plane 
is a catenoid. 


Example 3.7 A surface of the form 
r(u, v) = (asinhucosu, asinhvusinu, au) 


is called a helicoid. A helicoid is a minimal surface. 
Proof It is easy to check that E = G = a’ cosh* v, F = 0, and ry, +ryy =0. O 


HISTORICAL COMMENT Catenoids and helicoids were discovered by Meusnier in 
1776. 


Problem 3.34 Prove that the the spherical Gauss map of a minimal surface with 
nonvanishing Gaussian curvature is conformal. 
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3.18 The First Variation Formula 


In studying variations of curves on a surface S, it is convenient to consider piecewise 
smooth curves and their variations. A piecewise smooth curve is a map y: [0, 1] — 
S which defines smooth curves on intervals [#;_1, t;] for a partition 0 = tf) < th < 

- < t = 1 of the interval [0, 1]. The points p = y(O) and g = y(1) are the 
endpoints of the curve. 

A variation (with fixed endpoints) of a curve y(0,t) = y(t) is a family of 
curves y(s,t), s € (—e,€), t € [a,b], i.e., a map (—e, €) x [0,1] — S. Here s 
parameterizes the family of curves and ¢ is a parameter on each curve. For a fixed 
s, we obtain a curve in the family, and for a fixed t, we obtain a curve transverse 
to the family of curves. The endpoints of the curves are fixed, i.e., y(s,0) = p and 
y(s, 1) = q for all s. The restriction of the map to each strip (—é, €) x [tj-1, ti] is 
smooth. 

With each variation two vector fields are associated, which correspond to the 


partial derivatives with respect to ¢f and with respect to s: ov (st) is a tangent vector 
dy(s,t) 
os 


to a curve in the family and 
we will call a variation vector. 


is a tangent vector to a transverse curve, which 


Problem 3.35 Let W(t) be the restriction to a curve y(t) of some vector field 
tangent to the surface S. Prove that there exists a variation of y with variation vector 
W(t) at each point of y. 


A curve y(t) is said to be critical for a functional F if 


dF (y(s, t)) 
ds ees 


for any variation of this curve. 

The geodesics are critical curves for the length functional. For a parameterized 
curve y, the length functional equals L(y) = de V(y', y’)dt. In some respects 
the length functional is not as convenient as the energy functional E(y) = 
5 Lo ,y’)dt. The point is that the length functional does not depend of the 
choice of a parameterization, while the energy functional does. Therefore, for the 
energy functional, the parameterization of a critical curve is determined up to 
proportionality, while for the length functional, the parameterization of a critical 
curve can be arbitrary. This leads to the strong singularity of the length functional, 
which is a hindrance in some situations. 


Theorem 3.22 For a curve parameterized by an interval [a,b], the inequality 
L(y)? < 2(b — a)E(y) holds, which becomes an equality only for a curve whose 
parameterization is proportional to arc length. 


112 3 Surfaces in Space 


Proof Let f(t) = J(v'(t), y(t). According to the Cauchy—Bunyakovsky— 
Schwarz inequality, we have 


b 2 b b 
Loy =(f f(s) 1dr) <|/ Pat: f ldt = 2E(y)(b—a). 


The equality is attained only for f(t) = const. Oo 


Let us determine what piecewise smooth curves are critical for the energy 
functional. Consider a variation y(s,t) of a curve y(t). We use the following 
notation: 


W(t)= es ; is the variation vector; 
s= 


V@)= apt is the velocity vector of the curve; 
A(t) = Vy V is the acceleration vector of the curve; 
A,V = V(t + 0) — V(t — 0) is the jump of the velocity vector at a point ¢. 


Theorem 3.23 (First Variation Formula) The derivative BEE) 
S 


equals 
=0 q 


1 
-Ywo.av- f (W(t), A(t))dt. 
t 


Proof According to Theorem 3.13 on p. 96, we have 


ad [dy dy dy oy 
Baas ee es he ee 
~(%.2) (Vorare, 2 


Therefore, 


dE(y(s.t)) 10 fl (ay a a 
BED) 18 farm f (vyardt 2) a 
as 2as Jo \at’ at 0 or’ or 


Using the symmetry of the Christoffel symbols, we replace Vaya — in the last 
integral by 


The identity 
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makes it possible to perform integration by parts in the form 


ti ay ad (2% xr [ (2 ) 
Vy dt = _ —, V,'— | dt. 
is ( as’ at OS.” Ob) Veen uel: fied NON” OE 


Summing these expressions overi = 1, ..., k and taking into account the vanishing 
of the vector >— oy ; att =O andt = 1, we ohiain 
k-1 1 
dE(y(s,t)) OY OY / dy | ay 
= —,A,;—)- , Vy! dt. 3.17 
as 2 (5; i Or oe Nae ene tp 
i= 


Substituting s = 0, we arrive at the required expression 


JE(y(s, t)) 
Os 


1 
=-—)((WW), AV) - i, (W(t), A(@))dt. 
t 


s=0 


Oo 


Recall that a geodesic (with parameterization proportional to the natural one) is 
a smooth curve with zero acceleration A(t). 


Corollary A curve is critical for the energy functional if and only if this curve is 
geodesic. 


Proof The second term in the expression for the variation of the energy functional 
shows that a variation in the direction of the acceleration vector A(t) reduces E. 
Therefore, a critical curve is geodesic on each interval of smoothness. The first term 
in the expression shows that at the endpoints of these intervals a variation W of the 
curve in the direction of A;V reduces E, and hence a critical curve is smooth, that 
is, has no sharp corners. Oo 
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For functions of many variables, a critical point is not necessarily a point of local 
minimum or maximum. The condition that a point is critical is necessary but not 
sufficient for local minimality. A sufficient condition for a function to have local 
minimum is the positive definiteness of its Hessian matrix. 

Not all geodesics are shortest curves joining given points. Through any two 
points of the sphere which are not diametrically opposite infinitely many geodesics 
of different lengths pass, and only one of these geodesics is shortest. To distinguish 
the shortest geodesic among the other ones, we need a sufficient minimality 
condition. It is this purpose that the second variation formula for the energy 
functional serves. This formula applies only to critical curves, that is, geodesics. 
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ee a two-parameter varianion y(s1, 52, t) of a geodesic y(t) = y(0, 0, f). 
Let 30, 0,t) = Wi(t) and oy ~(0, 0,7) = Wa(t) be the variation vectors. The 
Hessian determinant E,,,(W}, W,) of the energy functional is the second derivative 


051 0s2 (s1,82)=(0,0) 


For brevity, we denote this second derivative by a a £ a (0,0). We also use the 


notation V = a = y’ for the velocity vector of the curve y and set 


ArVy Wi = Vy Wilt =o 0) _ Vy Wilt — 0). 


The second variation formula involves the Riemann curvature tensor R(X, Y)Z. 
We begin with a remark, which we will need shortly. The vector fields Wy = ah, 
W2= es and V = oy pairwise commute, because they are induced by commuting 
(according to Problem 3.27) coordinate vector fields via the map (s1,52,t) > 


y (s1, 52, t). Therefore, 


Vw, V = VvWi, (3.18) 


R(V, Wi)V = Vv Vw, V — Vw, Vv V = (Vv)? — Vw, Vv. (3.19) 


Theorem 3.24 (Second Variation Formula) For a gee oe alia variation of 


a geodesic y(t) = y(0,0,t), the second derivative wk os (0,0) of the energy 
functional equals 


1 
— Domo. avy = fo (wa. cw — RW, wv) di 
0 
t 
Proof Let us apply Eq. (3.17) obtained in deriving the first variation formula: 

OE 0 0 '79 0 

a > (3. ac) -[ (=. vy) dt. 

0s2 : 052 ot 0 \ds2 ot 


Differentiating with respect to s;, we obtain 

a°E dy . oy ay dy 
=— y Vay /as, —, Ar— |) — , ,Vv A 

a51 052 ; ( ete aan ) ig ae t) 


1 1 
dy dy : dy dy 
= Vine. ae Vy, Volek 
i ( es oe ) Gg Kae Oe Se 
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We must evaluate this quantity at the point (sj, 52) = (0,0). By assumption the 
curve y(t) = y(0, 0, t) is geodesic (has no sharp corners). Therefore, at the point 
(0, 0), 


Thus, the first and the third term vanish, and there remain only the second and 
the fourth one. Transforming them, we obtain the following equation, in which the 
notations sX(0, 0, t) = W(t), sX(0, 0, t) = W2(t), and V = ats = y’ are used: 


ay 1 
i Wo, Vay/as; Ars} — [| (We, Vayyas, Vv V) dt. 
Fo v= =a 25 Vdy/dsy i) ie 2, Vay/as; VV ) 
Equation (3.18) implies 


Voy jac, Aree = ApVy oe = AVyW 
abn Pe = Oy Gg = ary Mi 


It remains to check that 


Vay/as, WVV = (Vy)>W — R(V, Wi)V. 


This follows directly from (3.19). oO 


The second venation formula shows that the Hessian determinant 
E,,.(W,, W2) = ms i (0, 0) is a symmetric bilinear function in W, and W) (its 
symmetry follows from the symmetry of the second derivative). 

With each symmetric bilinear form a quadratic form is associated. In particular, 
the quadratic form associated with the bilinear form E,,.(W1, W2) is Ex.(W, W). 
This quadratic form can be expressed in terms of the one-parameter variation y (s, ft) 
with variation vector W without resorting to a two-parameter variation as 


d°E(y(s,t)) 


Ey. (W, W) = a2 : 
s=0 


Indeed, consider the two-parameter variation y (s1, 52, t) = y(s1 + 52, t). We have 


a 2E t a7 
av(81921) _ 8Y69 _ wey Ga y.p, PEN) _— PEYOD) 
0S; os 051082 as2 


For a minimal geodesic y, the quadratic form F,.(W, W) is nonnegative definite, 
because the Hessian matrix is nonnegative definite at a point of minimum. 
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3.20 Jacobi Vector Fields and Conjugate Points 


A vector field J given on a geodesic y is called a Jacobi vector field if it satisfies 
the Jacobi differential equation 


(Vy) J — R(V, JV =0, 


where V = y’ is the velocity vector. This is a second-order linear differential 
equation in two variables, and it has four linearly independent solutions. Each Jacobi 
vector field is completely determined by the initial conditions J(0) and Vy J(0). 
Recall that the expression (Vy)*W, — R(V, W,)V occurs in the second variation 
formula. 

Let p = y(a) and gq = y(b) be two points of a geodesic y for a # b. These 
points are said to be conjugate along y if there exists a nonzero Jacobi vector field 
J along y which vanishes at t = a and at t = b. The multiplicity of p and q 
as conjugate points equals the dimension of the vector space spanned by all such 
Jacobi fields. 


HISTORICAL COMMENT In 1838 Carl Gustav Jacob Jacobi (1804-1851) intro- 
duced the notion of a point conjugate to a point of a geodesic and stated conditions 
for a geodesic to be shortest. 


Consider the kernel of the form E,.,, which consists of all vectors W; such that 
E,,.(W,, W2) = 0 for all Wo. 


Theorem 3.25 A vector field W, belongs to the kernel of the form Ex if and only 
if W, is a Jacobi vector field. 


Proof Consider a Jacobi vector field J vanishing at points p and q. Let us apply the 
second variation formula: 


1 
Eax(J, Wo) = — D> (Walt), A:Vv J) i (We, (Wv)?F = RV, DV) dt. 
t 


In the case under consideration, A; Vy J = O (J is smooth as a solution of a 
differential equation) and (Vy)2J—R(V, J)V = 0(J satisfies the Jacobi equation); 
therefore, 


1 
Ex.(J, Wo) = =o.) f (W2, 0) dt = 0, 


t 0 


i.e., J belongs to the kernel of E,,.. 
Conversely, suppose that W; belongs to the kernel of E,,,.. Let us partition the 
interval [0, 1] by points 0 = f9 < ty) <--- < t, = 1 so that the vector field W, is 
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smooth on each of the intervals [t;1, ¢;]. Consider a smooth function f: [0, 1] > 
[0, 1] vanishing at the points fo, t1, ..., f% and positive at all other points. We set 


W(t) = fo) (WWW — RV, WV): 


of course, V and W; also depend on ft. We have 


—Exe(W1, W2) = 20+ ib fr |owvy?m — RLV, WV | at. 


This quantity equals 0; therefore, the restriction of W; to each of the intervals 
[t;-1, t;] is a Jacobi field. 

Now consider a vector field W, such that W5(¢;) = A;, Vv Wi fori = 1, 2,..., 
k — 1. We have 


k-1 1 
= Bui WS) = [dn Mil? + f Odt =0; 
i=l . 


this means that Vy W, has no jumps. But the solution W; of the Jacobi equation 
is completely determined by the vectors Wj (¢;) and Vy Wj (t;). Hence the k Jacobi 
vector fields (restrictions to the intervals [t;—1, t;]) are compatible with one another 
and can be glued together so as to form a smooth vector field on the interval [0, 1]. 

| 


Thus, the dimension of the kernel of E., equals the multiplicity of p and 
q as conjugate points. The dimension of the space of Jacobi fields vanishing at 
t = 0 equals 2. Therefore, the multiplicity of conjugate points is not higher than 2. 
Let us show that, in fact, the multiplicity is at most | (and therefore the notion 
of the multiplicity of conjugate points makes sense only on many-dimensional 
Riemannian manifolds). To this end, it suffices to construct an example of a Jacobi 
field which vanishes at t = 0 and does not vanish at t = 1. We set J(t) = tV (ft), 
where V(t) = y’(t) is the velocity vector of a geodesic. We have 


VvyJ=1-V+tVvV=V, 
because Vy V = 0. Hence 
(Vy) J = VyV =0. 


Moreover, R(V, J)V = tR(V, V)V = O, because the Riemann curvature tensor 
is skew-symmetric with respect to the first two variables. Thus, J(t) = tV(t) isa 
Jacobi field with J(0) = 0 and J(1) £0. 

Jacobi fields can be obtained by using variations with loose endpoints in the 
class of geodesics. Let y(s, t) be a family of geodesic, i.e., a family in which each 


118 3 Surfaces in Space 


curve y (so, t) is a geodesic; we do not fix the endpoints of these geodesics. Then 
the variation vector field J(t) = ura (0, t) is : Jacobi field. Indeed, since each curve 


y (So, t) is a geodesic, it follows that Vay /at oe a7 ’(s,t) = 0. Therefore, 


0=V V —=+=V V —R os 
dy/ds Voy/ot at dy/dt Yoy/ds Z_ at (=. Ds. 


Moreover, Vay /as = Vay (at xe whence 


dy 207 
Vay /at Vay /as 5 = (Vay at) ae 
Thus, at s = 0 we have 
(Vy) J — RIV, J)V =0, 


as required. 

The converse is also true: any Jacobi field can be obtained by a variation in the 
class of geodesics. But we will not prove this (for a proof, see, e.g., [Mi4], p. 81). 

If e1, e2 is an orthonormal basis, then in this basis the matrix of the first quadratic 
form is the identity, and hence Rae = —K and R(e1, e2)e) = Rii€2 = —-Kep. 
Suppose given a geodesic with natural parameterization; let e} = V be its velocity 
vector, and let e2 be the unit vector orthogonal to it. Finally, let J be a vector field 
orthogonal to the geodesic, i.e., J = ye. The vector field e; is parallel along the 
geodesic, and hence so is the vector field e. Therefore, Vy J = y’e2 and (Vy)2J = 
ye. Thus, for the vector field ye2, which is orthogonal to the geodesic, the Jacobi 
differential equation has the form (Vvy)2J + KI =0,ie., 


y’+Ky=0. 


The Jacobi equation for J = xe; + yez can be written in coordinates as x” = 0 
and y” + Ky = 0. Conjugate points are associated only with Jacobi vector fields 
for which x = 0, i.e., with vector fields orthogonal to the geodesic. Clearly, if 
y(t) is a solution of the equation y” + Ky = 0, then, for any constant c, cy is a 
solution of this equation as well. Therefore, in studying conjugate points, it suffices 
to consider solutions of the equation y” + Ky = 0 with initial conditions y(0) = 0 
and y’(0) = 1 in the domain t > 0. 

If K < 0 on the entire geodesic, then this geodesic contains no conjugate points. 
Indeed, in this case, y’(t) > 0, and hence y’(t) = 1+f y”(t)dt > 1, which means 
that the function y(t) monotonically increases and cannot vanish at tf > 0. Thus, 
conjugate points can exist only if the Gaussian curvature is positive at some points. 
If, in addition, the Gaussian curvature is bounded by a positive constant from below 
or above, then we can estimate the distance between neighboring conjugate points 
from above or below. For this purpose, we can use Sturm’s theory, which makes it 
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possible to estimate distances between neighboring zeros of solutions to equations 
of the form y” + Ky =0. 


Theorem 3.26 (Sturm) Suppose given two differential equations 
y’+a(x)y=0, 2’ +b(x)z=0 

in the interval [0, x1], where a(x) < b(x) in the entire interval; let y(x) and z(x) 

be solutions of these equations for which y(0) = z(0) = 0 and y'(0) = z’(0) = 1. 


Then y(x) = z(x) for x > 0 if both functions y and z are positive on the half-open 
interval (0, x]. 


Proof Clearly, y’z—yz" = (b(x)—a(x))yzand (y’z—yz’)’ = y"z—yz"; therefore, 


yz—-yz = i (b(x) — a(x))yzdx > 0. 
0 


This means that (y/z)’ > 0. 


According to L)H6pital rule, we have lim,_, 7 on = lim,-s9 a = 1; hence 
y(x) > z(x) for x > 0. oO 
Corollary 


(a) If z(x,) = 0 for some positive x, and the function z has no zeros between 0 and 
x1, then y(x) 40 for0 <x < x}. 
(b) If y(x1) = 0 for some positive x, then z(x2) = 0 for some positive x2 < x}. 


HISTORICAL COMMENT Jacques Charles Francois Sturm (1803-1855) developed 
comparison theory for solutions of differential equations in 1836. 


Theorem 3.27 Let y(t) be a geodesic with natural parameterization, and let K (t) 
be the Gaussian curvature at y (t). Suppose given a positive number K. 


(a) If K(t) < K forall t, then no point y(t), 0 < t < 2/K, of the geodesic is 
conjugate to the point y (0). 

(b) If K < K(t) forall t, then the geodesic has a point y(t), 0 < t < 2/VK, 
conjugate to y (0). 


Proof 


(a) We set a(t) = K(t) and b(t) = K and apply assertion (a) of the corollary to 


Sturm’s theorem. The solution z(t) equals K sin(t/ K); therefore, y(t) does 


not vanish for 0 < t < SE 

(b) We set a(t) = K and b(t) = K(f) and apply assertion (b) of the corollary to 
Sturm’s theorem. The solution y(t) equals kz sin(t/K ); therefore, z(t) = 0 
for some positive t < -E. 


JK 
Oo 
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A small enough part of a geodesic is a shortest curve joining given points (see 
Theorem 3.19 on p. 104). But if a part of a geodesic contains conjugate points, then 
it does not have this property. 


Theorem 3.28 Let y(t), t € [a,b], be a geodesic, and let y(a) and y(t), a < 
t <b, be conjugate points. Then y is not a shortest curve joining the points y (a) 
and y (b). 


Proof Let J(t) be a nonzero Jacobi vector field on the geodesic y(t) for which 
J(a) = J(t) = 0. Consider the vector field J (t) on y(t) which coincides with J (t) 
fora < t < t and vanishes fort < ¢ < b. At the point y(t) the jump A,VvJ — 
Vv J(t) of the velocity vector is nonzero. Indeed, otherwise, since J(t) = y(t)e2(t) 
and Vy J(t) = y'(t)e2(t), the equations y(t) = y’(t) = 0 would imply that the 
vector field J(t) must be zero. 

Let us construct a vector field X(t) on the curve y(t) so that X(a) = X(b) = 0 
and (X(t), A; Vy J) = | and consider the vector field W = is — cX. We have 


1 a 7 
Evx(W, W) = EB wx(S, J) — 2E a(S, X) + c Ex (X, X). 
It follows from the second variation formula that 
Exx(W, W) = 0 — 2(X(t), ArVy J) +c? Exx(X, X) 
94 BX), 


If c is small enough, then E,,.(W, W) < 0. 
Take a vector field W for which E,,,.(W, W) < 0 and consider the variations 


a(s,t) = EXP, (7) (s W(t), 
B(S1, 52, t) = a(s1 + 5, t) = exp, (C51 + 52) W(t). 
We have 


dE(a(s,t aE oat 

SEG )) SECC, D) = Eg(W, W) <0. 
ds |s=0 451952 (s1.82)=(0.0) 

Therefore, the map s + E(a(s, t)) has a strict local maximum at s = 0. Thus, the 

curve y cannot be a strict local minimum for the functional EF. oO 


It follows directly from Theorems 3.27 and 3.28 that if the Gaussian curvature 
of a surface is not smaller than a given positive number K at each point, then the 
shortest length of a geodesic on this surface is at most wiae 

Recall that a metric space is complete if it contains all of its limit points. An 
example of a complete metric space is a surface without boundary embedded in R? 
as a closed subset. 
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There are points on a surface with boundary or on a surface minus a point 
that cannot be joined by a geodesic. But on a surface without boundary, which is 
complete as a metric space, any two points can be joined by a geodesic. This follows 
from the Hopf—Rinow theorems, which we discuss in Sect. 5.4 (see Theorem 5.5 on 
p. 175 and Theorem 5.7 on p. 177). As a result, we obtain the following theorem. 


Theorem 3.29 (Bonnet) Jf the Gaussian curvature of a complete surface without 

boundary is not smaller than a fixed positive number K at each point, then this 

surface is compact. 

ae 

VK’ 

Therefore, the distance between any two points of the surface does not exceed Tre 
oO 


Proof If the geodesic is longer than then this geodesic is not minimal. 


HISTORICAL COMMENT Bonnet proved the theorem on the maximum length of 
a geodesic on a surface whose Gaussian curvature is bounded below by a fixed 
positive number at each point in 1855. 


To conclude this section, we give one more description of conjugate points: 
points p and exp,(V) on a geodesic y(t) = exp,(tV) are conjugate if and only 
if V is a critical point of the map exp,,. Before proving this assertion, we explain 
it, because it is concerned with such an intricate object as the tangent space to the 
tangent space. The vector V is a point of the tangent space at p. At the point V the 
differential of exp, is defined, which is a linear map from the tangent space at V to 
the tangent space at exp,(V). The point V is a critical point of exp, if this linear 
map of tangent spaces is degenerate, i.e., some nonzero vector X in the tangent 
space at V is mapped to zero. 


Theorem 3.30 Points p and exp,(V) on the geodesic y(t) = exp, (tV) are 
conjugate if and only if V is a critical point of the map exp,. 


Proof First, suppose that V is a critical point of the map exp, and a nonzero vector 
X in the tangent space at V is mapped to zero. Let us represent X as the velocity 
vector of a curve V(s) in the tangent space at p: V(0) = V and V’(0) = X. 
Consider the family of curves exp, (¢V(s)); for fixed s, each of these curves is 
geodesic, i.e., this family is a variation of the geodesic y(t) = exp,(tV) in the 
class of geodesics. Therefore, W(t) = #|,—o exp, (tV(s)) is a Jacobi field along 
the geodesic exp pit V). Clearly, W(0) = 0 and 


0 
Wi)= a exp,(V(s)) = (exp,)«(V'(0)) = (exp,)«(X) = 0. 
0 


s= 
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It remains to check that the vector field W(t) is not identically zero. To this end, we 
calculate its covariant derivative at t = 0: 


V,'W(0) = V,’ 


) 
10 55 OPPS) 


= Va/as|,— ~ exp, (V(s)) = Vayas|,_9 V(S)- 
The last expression is the covariant derivative of the vector field s +> V(s) along 
the constant curve s +» p. When the curve contracts to a point, the covariant 
derivative of the vector field transforms into the usual derivative with respect to 
s. Thus, V,,W(0) = V’(0) = X £0. 

Now suppose that V is not a critical point of the map exp,. Take a basis X1, X2 
of the tangent space at V; then exp,,(X1), exp,.(X2) is a basis of the tangent space 
at exp,(V). Let us represent the vectors X; and X2 as the velocity vectors of curves 
V\(s) and V;(s) in the tangent space at p such that V;(0) = V and V; (O) = X; and 
consider the variations exp p (t Vi (s)) and the corresponding variational fields W; (t). 
Clearly, W; (0) = 0, and the vectors W; (1) = (exp p(X i) are linearly independent; 
hence their nontrivial linear combinations cannot vanish at t = 1. The dimension of 
the space of Jacobi fields vanishing at t = 0 equals 2, and therefore the Jacobi field 
on the geodesic exp, (tV) cannot vanish both at t = 0 and at t = 1. oO 


3.21 Jacobi’s Theorem on a Normal Spherical Image 


The principal normal indicatrix (also known as the normal spherical image) T of a 
space curve C is the set of heads of the principal normal vectors (with the same tail) 
to this curve. Recall that a principal normal vector is the vector e2 in the Frenet— 
Serret formulas. 


Theorem 3.31 (Jacobi) Jf the principal normal indicatrix of a curve is a regular 
curve without self-intersections, then it divides the unit sphere into two parts of the 
same area. 


Proof Lets and s be the natural parameters of the curves C and I. It follows from 


the Frenet—Serret formula e, = —ke + xe3 that (43)° = Iles || = k? + x? and 
den de ds ds 
ee i) eae a 3.20 
rae ar Tie aT oe) 
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Differentiating this equation and taking into account the two other Frenet—Serret 
formulas, we obtain 


@ d's 2 d 2 
cee kei + xes)— + (Ker + x"e3) ~(P+%%)e() - 
ds ds 
(3.21) 
here k’ = & and x! = = Be 


Let us ere the geodesic curvature kg of the curve I on the sphere. Let ¢1, 


€2, €3 be the Darboux frame of this curve. The curve I" is traced by the head of the 


vector e2, and therefore e; = a, The vector e2 is the normal vector to the sphere; 


hence €3 = e2 and €2 = e2 x a2. Thus, 


dey _ dé 


rey = ae = Kgé2 + kyé3. 


Using (3.20), we obtain 


i dey der dey 
=(e,—)]= x= 
8 marie OS a as? 

;: n d?ey\ ds _ ious de ds 
={- xe xe2 X €3, — el, ‘i 
€2 X e] 2X €3, > | Ge = Res + MeL Te — 


Now Eq. (3.21) gives 
- ds\> x'k—-Kx ds d d 
kg = (a'k — k’x) a eae a =— (arctan =). = 

ds k2+x2 ds ds k 


whence keds = £ (arctan +) ds. 
As aresult, we obtain 


= a2 d 
/ Keds = [= (arctan = —)ds= = 0, 
Pr cds 


because the curve C is closed and k does not vanish. 

The rest of the proof is quite simple. Let R be one of the two domains of the 
sphere bounded by the curve I’, and let A be its area. The Gaussian curvature K of 
a sphere of radius 1 equals 1, so that the Gauss—Bonnet formula yields 


an = | kas+ f Kaas | dA=A. 
r R R 


HISTORICAL COMMENT Jacobi proved the theorem on a normal spherical image 
in 1842. 


| 
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3.22 Surfaces of Constant Gaussian Curvature 


In polar geodesic coordinates (r, g) the first quadratic form is dr? + G(r, y)dy’, 
and the Gaussian curvature K is expressed in terms of G as 


1 &PJ/G 
VG ar? 
(see formula (3.15) on p. 102). This expression can be written in the form 


0°./G 
or 2 


+KVG =0. (3.22) 


If the Gaussian curvature K is constant, then we obtain a second-order linear 
differential equation with constant coefficients. 


Theorem 3.32 (Minding) Any two regular surfaces with the same constant 
Gaussian curvature are locally isometric. Any point of one of the surfaces and any 
orthonormal basis at this point can be mapped to any point of the other surface and 
any orthonormal basis at this point by a local isometry. 


Proof First, we solve Eq. (3.22), considering three cases: (1) K = 0, (2) K > 0, 


and (3) K < 0. Recall that lim,;_.9 G = 0 and lim,_.9 Bye = | (this was proved on 
p. 102). 


(1) If K = 0, then “YE rug = 0. Therefore, 2Y@ = f(g) is a function of y. The 


function 2 ave ee not depend of 7, and tise afG = 1; hence VG=r + 
g(g). The fanetion g(¢) does not depend of r either; hence g(g) = lim;_.9 G = 
0. Thus, in case (1), we have E = 1, F =0,andG =r”. 

(2) If K > 0, then the general solution of Eq. (3.22) has the form 


VG = A(¢) cos(V Kr) + Bg) sin(V/ Kr), 


where A(gy) and B(g) are some functions. Since lim-_.9G = 0, we have 
A(g) = O. Therefore, 


ke = B(g)VK cos(V Kr). 


From lim,_,9 = = | it follows that B(g) = 


= 0, and G = x sin 2¢./Kr). 
(3) : K <0, then the general solution of Eq. (3.22) has the form 


JG = A(v) cosh(/—Kr) + B(g) sinh(/—Kr). 


1 : _ 
Wire Thus, in case (2), E = 1, 
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From the same considerations as in the previous case we obtain E = 1, F = 0, 
and G = —& sinh? (./—Kr). 

To construct the required local isomorphism, we identify neighborhoods of the 
given points with neighborhoods of tangent spaces by means of the map exp™!. 
After that, we apply the isometry of the tangent spaces which maps one orthonormal 
basis to the other. o 


HISTORICAL COMMENT Minding proved the theorem on the local isometry of 
surfaces of constant Gaussian curvature in 1839. He also found equations for 
surfaces of revolution of constant curvature. In 1840 Minding derived trigonometric 
relations in a triangle formed by geodesics on a surface of constant curvature. He 
discovered that trigonometry on a surface of constant negative curvature is obtained 
from that on the sphere by replacing the usual trigonometric functions by the 
hyperbolic ones. But Minding overlooked the connection between this discovery 
and hyperbolic geometry. The first to notice this connection was Eugenio Beltrami 
(1835-1900); this had enabled him to construct the first model of hyperbolic 
geometry in 1868. 


Now let us look at the structure of a surface of revolution of constant curvature. 
Recall that, according to Problem 3.11, if the parameterization of a meridian of a 
surface of revolution is natural, i.e., the parameterization of the surface has the form 
(f(s) cosv, f(s) sinv, g(s)), where (f’)? + (g’)? = 1, then the Gaussian curvature 
equals —£. 

First, consider the case of positive Gaussian curvature. In this case, we have K = 
a’, where a > 0, and the function f is determined by the equation f” + a7 f = 0. 
The solutions of this equation have the form f(s) = a, cos(as) + a2 sin(as) = 
Acos(as + b). Appropriately choosing the point from which the length of an arc is 
measured, we can assume that b = 0. We can also assume that A > 0. Since the 
function f must not vanish, it follows that |s| < 3. Moreover, g =tV/1-(f), 


whence we obtain 
Ss 
g(s) = at y 1 — a2A? sin? (at)dt. 
0 
1 


If A > 1/a, then the function g(s) is defined only when sin |as| < 7. 

For A = 1/a, the surface is a sphere. Curves and axes of rotation for A < 1/a 
and A > |/a are shown in Fig. 3.3. In both cases, the arc of the curve is open, 1.e., 
it does not contain its endpoints. In the second case, the limit position of the tangent 
as an endpoint is approached is perpendicular to the axis of rotation. 

In the case of zero Gaussian curvature, the equation for the function f has the 
form f” = 0, and the solutions of this equation have the form f(s) = as + b. 
Moreover, g’ = +,/1 — (f’)? = +V1—a?, and hence g = cs + d, where c = 
+1 —a?. For a = 0, we obtain a right circular cylinder, for c = 0, we obtain a 
plane, and for ac ¥ 0, we obtain a right circular cone. 
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A<+ A>+ 


Fig. 3.3. A surface of rotation of positive Gaussian curvature 


Fig. 3.4 Surfaces of revolution of negative Gaussian curvature 


Finally, consider the case of negative Gaussian curvature. In this case, K = —a’, 
where a > 0, the equation for f has the form f” — a? f = 0, and the solutions of 
this equation have the form f(s) = a; cosh(as) + a2 sinh(as). Depending on how 
a, and az are related, the solution can be written in one of the three forms (with real 
coefficients) A cosh(as + b) (if ay > az), Be® (if ay = az), and C sinh(as + c) (if 
a, < az). In these cases, respectively, g(s) = ceap 1 — a2A? sinh? (at)dt + D, 
g(s) =+ fy V1 — a? B7e2"dt+ D, and g(s) = + fy V1 — a?C? cosh? (at)dt+D. 
The corresponding curves and axes of rotation are shown in Fig. 3.4. 
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3.23 Rigidity (Unbendability) of the Sphere 


Locally, the sphere is not rigid. Indeed, in the previous section we gave examples of 
surfaces of constant positive Gaussian curvature. They are locally isometric to the 
sphere but cannot be obtained from the sphere by a motion of space. Nevertheless, 
globally, the sphere is rigid. 


Theorem 3.33 Let S be a closed connected surface of constant Gaussian curvature 
K embedded in R?. Then S is the sphere Se 


Proof A closed surface embedded in R? is orientable. If we choose its orientation, 
then its principal curvatures will be defined (when no orientation is chosen, its 
principal curvatures are defined only up to sign). We assume that the principal 
curvatures k, and k» of the surface are chosen so that kj > kz at each point of 
S. These functions are continuous everywhere on the surface S and differentiable 
everywhere except possibly at umbilical points. 


Lemma Suppose that the Gaussian curvature of an oriented smooth surface is 
positive at a point p and that this point is a point of local maximum of the function 
ky and a point of local minimum of the function kz. Then the point p is umbilical. 


Proof Suppose that the point p is not umbilical. Then in a neighborhood of p we 
can introduce coordinates (u, v) so that the coordinate lines are lines of curvature 
(see p. 107). Then the first and the second quadratic form are E du? + Gdv* and 
kiE du2 + koGd v’, and the Codazzi—Mainardi equations are written as 


Ey Gu 
(Ey = i +h), a@u = ki +b), 
1.€., 
E G 
E(ki)y = smh +k), Glko)u = om — ko). (3.23) 


According to Theorem 3.3 (see p. 76), in the case where F = 0, the Gaussian 
curvature K is expressed in terms of the coefficients of the first quadratic form as 


—~$Guu — 5 Evy 5Eu —5Ey 0 SE, 5Gu 
K(EG) = —4Gy BE 0 |=|28) 2 °O- | 
1G, 0 G 5G, 0 G 
therefore, 
— 2K EG = Ey t+ Guy + PiEy + Q1Gu, (3.24) 


where P; and Q, are continuous functions of u and v. 
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Let us express EF’, and G, from Eq. (3.23), differentiate the obtained expressions 
with respect to v and u, and substitute the resulting expressions for Ey, and Gy, 
into Eq. (3.24): 


2E 2G 
—2K EG = ———~ (ki) py + —— (ka)uu + Pa(ki)y + Q2(ko)u, 
ky _ ko ky _ ko 


—(ky — ky)KEG = —2E (ky) yy a 2G (k2) uu + P3(k1)y Se O3(k2)u- 


At the point p the quantity — (ki —k2) K EG is negative. Since p is a point of local 
maximum for the function k; and a point of local minimum for k2, it follows that 
(ki)y = (ka)u = 90, (ki)uv < 0, and (k2)y, > 0. Thus, at the point p the quantity 
—2E(ki)uy + 2G(ka)un + P3(ki)y + Q3(k2)y is nonnegative. This contradiction 
shows that p is a umbilical point. oO 


Any closed surface embedded in R? has at least one point of positive Gaussian 
curvature (this follows from Problem 3.12 on p. 74). Choose such a point on the 
surface S. Since the Gaussian curvature of S is constant, we have K > O. The 
continuous function k; on S attains its maximum at some point p. The product 
k,ko = K is a positive constant; therefore, the function kz attains its minimum at p. 
Thus, according to the lemma, the point p is umbilical, i.e., kj = k2. 

Now consider any point g of the surface S$. By assumption ki(q) > k2(q), 
whence 


ki(p) > ki(q) & ko(q) S ko(p) = ki (p). 


Therefore, ki(q) = k2(q), i.e., all points of S are umbilical. Moreover, the surface 
S is closed (by assumption), and according to Problem 3.19, this surface is a sphere. 
| 


Corollary The sphere S* is rigid in the following sense: if there exists an isometry 
of S* to a surface S embedded in R?, then S is the sphere S. 


Proof Let g: S* —> S be an isometry of the sphere to a surface S. Then S 
has constant curvature, because Gaussian curvature is invariant with respect to 
isometries. The surface S is compact and connected as a continuous image of a 
connected compact space. Therefore, according to Theorem 3.33, Sis asphere. O 


HISTORICAL COMMENT Theorem 3.33 was proved by Heinrich Liebmann (1874— 
1939) in 1899. It is known as Liebmann’s theorem on the rigidity of the sphere. 
The lemma on which the proof given above is based was proposed by David Hilbert 
(1862-1943) in 1901. It is called Hilbert’s lemma. 
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3.24 Convex Surfaces: Hadamard’s Theorem 


A convex surface is the boundary of a bounded convex body in R?. 

The Gaussian curvature of a convex surface is nonnegative at each point. Indeed, 
a convex surface lies on one side of the tangent plane at each point, and therefore all 
normal curvatures are of the same sign (although some of them may vanish). 


Theorem 3.34 (Hadamard) /f the Gaussian curvature of a connected closed 
surface S embedded in R? is positive at each point, then this surface is convex. 


Proof Let us choose the inward normal vector at each point of S. Then all normal 
curvatures are positive at each point of the surface. 

Consider the open set D bounded by the surface S. Choose an arbitrary point A 
in D and let Dg denote the set consisting of all those points B € D for which the 
segment AB is contained in D. Obviously, the set D4 is open both in R? and in the 
topological space D. Let us show that Dy is closed in D. 

Let B, be a sequence of points of D4 converging to a point By € D. Suppose 
that Bp ¢ Da. This means that the surface S$ is tangent to the segment AB at some 
interior point P of this segment. Then the segment AB lies in the tangent plane to 
S at the point P, and the normal curvature corresponding to the direction of AB is 
nonpositive, because in a small neighborhood of P the normal vector and the section 
lie on different sides of the tangent plane. (The normal vector and the points B,, lie 
on one side of this plane and the section lies on the other side.) 

Since D is connected, it follows that D4 coincides with D. Thus, the segment 
joining any two given points of D is entirely contained in D, which means that the 
set D is convex. oO 


Remark Hadamard’s theorem is valid not only for an embedded surface but also for 
an immersed one; thus, if the Gaussian curvature of an immersed closed surface is 
positive, then this surface is embedded. The proof of this theorem for hypersurfaces 
(Theorem 4.3) is given on p. 149. 


HISTORICAL COMMENT Jacques Salomon Hadamard (1865-1963) proved the 
theorem about the convexity of a surface of positive Gaussian curvature in 1897. 
3.25 The Laplace—Beltrami Operator 


Recall that the Laplace operator in the plane with coordinates (x, y) assigns the 
function 
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to each function f(x, y). The Laplace—Beltrami operator is a generalization to 
curved surfaces of the Laplace operator on the plane. The definition of the Laplace— 
Beltrami operator uses the metric of the surface in one way or another. 

For the Laplace—Beltrami operator, the same notation A is used. However, by a 
well-established tradition, the Laplace—Beltrami operator A in the case of the plane 
(and, in general, of any Euclidean space) equals the negative of the classical Laplace 
operator rather than this operator itself. Such a notation is chosen to ensure the 
nonnegativity of the spectrum of the Laplace—Beltrami operator. 

The simplest definition of the Laplace—Beltrami operator is in terms of the 
differential forms w! and w* dual to a moving orthonormal frame. Let df = 
ajo! + anw*. We set xdf = —ayo! + aw. Then d x df = d(—anw! + aa") = 
(-Af)o! A”, i.e., Af is defined as the coefficient of the 2-form w! A w* with the 
minus sign. 

In the case of the plane, this eeptOe leads es oe usual Laplace operator (with 
the minus sign). Indeed, we have df = Y dx +% ae dy and *df = —Hax + Y dy 
Therefore, 


C) 7) 
d«xdf=d Be pecan Sy, A dx ree ae, 
oy ox ax? ax? 
Ear 
-(7 +35 5) dx Ady. 


Another definition of the Laplace—Beltrami operator is in terms of the composi- 
tion of divergence and gradient: Af = — div(grad f). The gradient of a function f 
is the vector field grad( f) characterized by the property 


(grad f, V) = ov f 


for any vector field V. Here dy f is the derivative of f in the direction of the vector 
field V. 

The definition of the divergence of a vector field involves the differentiation of 
a vector field (1.e., covariant differentiation), rather than that of a function. The 
divergence div(V) of a vector field V is defined at each point p as the trace of 
the map W +> VwV; here the vector W ranges over the tangent space at p. 

The Laplace—Beltrami operator can be expressed in local coordinates in terms of 
the Riemannian metric g;;. We will do this step by step: first, we will express the 
gradient, and then, the divergence. 

Consider local coordinates u’ and the coordinate vector fields i= 4. We have 


V = V'r; and 8ij = (i, rj); therefore, 


au! du! 


a a) d d 
vf=Vi= L = Vigne S = Vien mg oe = (v. fen) : 
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Thus, 


0 
1OF 


rad f = g* 
grad f faa 


To calculate the divergence, we need expression (3.7) on p. 93: 


/ [OV 
vev = wi (vin A) re 


From this expression we obtain 
div = og vip 2 OM iri 
1V = a + ij — Bat + ij- 


Now we apply Theorem 3.7 (see p. 86) and express the Christoffel symbols in terms 
of the Riemannian metric: 


ri A "(3s d8ij wh) = 5 i1 9811 


aul aul Bui 


a) 2° Gul” 


Thus, 


avi 1 agu avi I | ag 
div V = —— ne a = Vitr na 
aul 3 © aw aul 3 Bul 


where g is the matrix of the first quadratic form. To perform further transformations, 
we need the matrix algebra formula £ In(det A) = tr (Aq! a) (see Appendix, 
Theorem 8.2 on p. 258). Using it, we obtain 


avi 1 
med is oe ye 
divV = a +- 5V 7 (In det g) = asa (V/./det g). 


As aresult, we arrive at the following expression for the Laplace—Beltrami operator 
in local coordinates: 


Af = —div(gradf) = 


Se ~. ( vaste gg! a) . (3.25) 


Let us check that the two approaches to the definition of the Laplace—Beltrami 
operator are equivalent. To do this, it suffices to prove that, for any function g 
defined on the given surface and vanishing outside a given small neighborhood U 
of the point p, both definitions of Af give the same integral ie gAf do, where 
do is the area form. We will use two systems of local coordinates, arbitrary local 
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coordinates u!, u? and local coordinates x!, x* orthonormal at the point p. For the 
metric gij in the coordinates u', we obtain 


( ) or or dr ax* ar ax! Oe dx 

pS ial) = |) oS SS Se SS eS 

Bij peed du’ Ous ax* du!’ ax! dul du! dus 

in the last expression the summation over k is performed, although both k’s are 
superscripts. 


Let us calculate the integral f pAf do = f[ pAf /gdu' A du’ for Af defined 
in the first way. By definition, we have Af do = —d(*df), whence we obtain 


[vatde = = f edcsdf) = fava ceapy: (3.26) 


we have used the relations f,, pd(#df) + Jy dy A (*df) = fy d(y(rdf)) = 
Soy GCrdf) = 0. Next, 


a a @ 

[avrcary =f (Peart + Par?) a (-2oax! + hax ’) 
7 a9 df ay af 
-([(S5+34 ae: 

/ dui aui\ af ay 
= — — }) —-——-do 
axk axk } dus dui 
= fe’ (0 OO Feat x aie 


aul aui 


1 @ | Of 
= -| eee ot ( veetee’ =) gVdet gdu! A du. 


The last equality follows from the equation 
1 0 ( jaeted , Of JE 1 0 ( aS ; Of iat) ty ; 9 Of og 
J/det g du! "88 aul® Jdetg dui gs "Ou ry aa oul 


Thus, the first approach leads to the expression (3.25) in local coordinates for the 
Laplace—Beltrami operator. 


HISTORICAL COMMENT The Laplace—Beltrami operator for surfaces was intro- 
duced by Beltrami in 1864-1865. 


Theorem 3.35 [fa function f ona closed orientable surface is such that Af = Xf 
for some constant i, then d > 0. 
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Proof Let us apply formula (3.26) to g = f: 


2 2 
| farao - farsoan= | ((F) + (5) )>o 


Here the expression in local coordinates is written at one point. The integral is 
nonnegative because the expression has this form at each point. 
Clearly, the sign of the integral 


[ fara =2f Pao 


coincides with that of A. Therefore, A > 0. oO 


Theorem 3.36 (Bochner) /f a function f on a closed orientable surface is such 
that Af > 0 at all points, then this function is constant. 


Proof Applying formula (3.26) to g = 1, we obtain 


[arao= facren wap) =o. 


Therefore, Af = 0. Now let us apply the same formula to g = f: 


=f raran= farnoan= | (4) +(F)), 


Thus, df = 0 and f = const. oO 


HISTORICAL COMMENT Salomon Bochner (1899-1982) proved in 1946 that if a 
function g on a closed Riemannian manifold satisfies the condition Ag > 0, then 
this function is constant. This assertion is often called Bochner’s lemma. Bochner’s 
lemma is closely related to a general theorem about elliptic operators proved by 
Eberhard Hopf (1902-1983) in 1927. 


A function f for which Af = 0 said to be harmonic. 


Problem 3.36 Prove that the projection of a minimal surface onto any straight line 
is a harmonic function. 
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3.26 Solutions of Problems 


1 
3.1 Answer: € °) and (; [ . The Cartesian coordinates (x, y) are expressed 
r By) 


ri 
in terms of the polar ones as x = r cosy, y = r sing. Therefore, e, = (%. 3) — 


(cos, sing) and eg = (&. 32) = (-rsing,rcosq). Calculating the pairwise 


inner products of these vectors, we find the matrix (g;;). 


1\2 —— ; 0) 

3.2 Answer: I+(f) 2 and [{ +") , |. Here ev=(f’ cos v, f’ sin v, 1) 
0 f 0 72 

and e, = (—f sinv, f cos v, 0). , 


3.3 Let y(s) and y(s) be two curves on S with common tangent at the point y (0) = 
y (0), where s is the natural parameter. Then, for small s, the distance d(s) between 
y(s) and y(s) is of order s; to be more precise, the ratio d(s)/ s? tends to a finite 
limit as s — 0. Therefore, if €3(s) and €3(s) are the unit normals to the surface 
S at the points y(s) and y(s), then &% (0) = é3 (0). By assumption €;(0) = & (0); 
it is also clear that ¢3(0) = &3(0). Therefore, ¢2(0) = €&2(0), and the equation 
£3 = —kné1 — %g€2 implies k, (0) = k,(O) and %g(O) = %g (0). 


3.4 The vectors e2, e3, €2, and &€3 lie in the same plane (orthogonal to the vector 
é, = &1), and the bases e2, e3 and €2, €3 have the same orientation (see Fig. 3.5). 
Therefore, e2 = €2 sin@ + €3cos@, &€2 = e2 sind — e3cos@, and €3 = e2cosd + 
e3 Sind. Hence e/ = e; = kez = (ksin@)e2 + (kcos@)é3. On the other hand, 
€ = kgé2 + kye3. Thus, kg =k sin@ andk, = kcos@. 


Fig. 3.5 The two bases €3 A 


E3 


E2 
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Let us calculate x,. We have Ey = —kg&1 + #g&3, so that x, = (¢3, £5). The 
Frenet—Serret formulas give 


dg dg 
& = cose — (ksin@)e; + (% sin €)e3 + sin 0-3 + (% cos A)e2. 
KY KY 


Therefore, (€3, €5) = + a. 


3.5 According to Problem 3.4, kg = k sin@ and k, = kcos@, whence i + ke = 
k?(sin? 6 + cos? 0) = k?. 


3.6 


(a) Let k,(@) be the normal curvature of the curve in the intersection of the surface 
and the plane making an angle of @ with the normal plane. Then k,(0) = 
k(@) cos @ in view of Problem 3.4. But all curves in the family have common 
tangent vector; therefore, according to Problem 3.3, all of them have the same 
normal curvature k,(@), which equals the curvature k of the normal section. 
Let R(@) = 1/k(@) be the curvature radius of the curve in the intersection of 
the surface and the plane making an angle of 0 with the normal plane, and let 
R = R(O) = 1/k. Since k = k(@) cos@, we have R(@) = Rceos@. Therefore, 
the intersection of the family of osculating circles with the plane perpendicular 
to the surface and to the given tangent vector is a circle (see Fig. 3.6). This 
means that the osculating circles fill a sphere. 


(b 


wm 


3.7 Let y(s) be a curve with natural parameterization lying on a sphere of radius 
R centered at the origin. Then ¢) = y’, €3 = +4y, and €2 = &€3 X €1. Twice 


differentiating the identity (vy, y) = R* = const, we obtain0 = Ly, y= 
2(v', v') + 2(y", yv). Therefore, 


1 1 1 
k, = (s' = ae =pt—(y' vy) = 7-. 
n (€4, £3) (v ; r) RY y) FR 


Fig. 3.6 Spheric sections 


Roos @ 0 R 
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Next, 
dey il d fies que ! iy! ” 
ae rk Bas eae xy pee 
1 1 Ke 
eee id x Kye. — Re Fe pe x €2 = kge3 X €2 = —kgé}. 
Hence it follows from £5 = —kgé, + xge€3 that x, = 0. 


3.8 Answer: +~ eee . The identity k? +k; = k? is to be used. In the case under 


consideration, kK = 1/r? and ke = 1/R?. Therefore, ke = Le 


3.9 For a straight line, the velocity vector ¢; is constant, and hence e = 0. The 
zero vector is orthogonal to any plane. 


3.10 


(a) As a meridian we can take the initial plane curve. If ¢; is the unit tangent vector 
to this curve, then the vector ¢{ is orthogonal to the curve. But then it is also 
orthogonal to the plane tangent to the surface, because the tangent plane passes 
through the tangent to the curve and is orthogonal to the plane of the curve. 

(b) If € is the unit tangent vector to the circle, then the vector ej is directed from 
the point of tangency to the center of the circle (or from the center of the circle). 
It is orthogonal to the tangent plane if and only if the tangent to the meridian is 
parallel to the axis of rotation. 


3.11 Clearly, r, = (f’cosv, f’sinv,g’), mm = (—fsinv, f cosv, 0), 
uu = (f" cosy, f" sinv, g”), ruv = rou = (—f'sinv, f'cosv, 0), and roy = 
—fcosv,—fsinv,0). Moreover, n = ——L—(~—¢' cos v, —¢’ sinv, f’). 
(Sf f ) ey & § : i) 
Therefore, E = (f’)? + (g’), F = 0, and G = f?; wealso have L = af eae wt 
F) e ) f fee 
M =0,and N = os oe Hence the determinant of the first quadratic form 
Vf) +(8')7 se 

equals f*((f’)* + (g’)*), and the determinant of the second quadratic form equals 
fe'(g" f'~f"8') _ si(e" f'-f"'s') 

renee ° Thus, K = FC mregnas 

Now suppose that the parameterization of the meridian is natural, i.e., (f’)? + 
(g’)* = 1. Differentiating, we obtain f’ f” + g’g” = 0. Therefore, 


_ gol Ft a. f(g’)? a ay Sd @ i veal f(g’? _ aoe, 
f f Ey 


3.12 Take a sphere of radius R enclosing the given surface, consider the family 
of spheres with the same center also enclosing this surface, and choose a sphere of 
minimum radius among them. This sphere and the surface have common tangent 
plane at some point p. At this point p we choose the normal vector directed inside 
the sphere for both the sphere and the surface. Then all normal sections of the sphere 


K 
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and the surface have positive curvature at p. Therefore, according to Problem 1.5, 
all normal curvatures of the surface at the point p are at least 1/R, and the Gaussian 
curvature is at least 1/R?. 


3.13 


(a) Let (x!, x”, x3) be coordinates in space R?. Clearly, SF (p) = 2(p' _ q'). 
Therefore, the derivative of F in the direction of € equals 2(p — g, €). We have 
2(p — q,&) = 0 for all vectors € tangent to S at the point p if and only if the 
vector Pa is orthogonal to S at p. 

(b) It is sufficient to consider the case where the surface S is given as the graph of 
a function z = f(x, y) with 3£(0, 0) = 0 and 5£(0, 0) = 0 and f (0,0) =0.In 
this case, the point p = (0, 0, 0) is critical if and only if g = (0, 0, r). Indeed, 
for a critical point q, the equation (q, £) = 0 must hold for all € = (é!, €7, 0). 


Clearly, F(x, y) = x7 + y* + (f(x, y) — 1). A simple calculation shows that, 
at the point p = (0, 0, 0), 


ea ra =2 (' _ tfex they ) 

Fry Fyy thy 1—tfyy 

Thus, the critical point (0, 0, 0) is degenerate if and only if det(A — tB) = 0, where 
A = I is the matrix of the first quadratic form and B is the matrix of the second 


quadratic form. The equation det(A — tB) = 0 is equivalent to the conditions t 4 0 
and det(B — t~'!A) = 0, ie., t = 1/kj. 


3.14 It can be assumed that the surface is given in the same form as in Example 3.4 
and the line / is determined by the equations y = 0 and z = 0. Then f(x, 0) = 0 
and f;,(0,0) = 0; therefore, at the point (0,0, 0) the Hessian matrix has the 
0 : ; ‘ 
form ( fry ) Its determinant equals —(fry)? < 0. The sign of the Hessian 
xy Jyy 
determinant coincides with that of the Gaussian curvature (moreover, in the situation 
under consideration, the matrix of the first quadratic form is the identity, and 
therefore the Hessian determinant equals the Gaussian curvature). 


3.15 The equation da} = ok A a yields 
3 
do; =o A a + 01 Ao; +03 Aw; =a} Aw}. 
The proof of the remaining two relations is similar. 
3.16 The equation dw! = w! A wo} yields 
dw =a! Aa, +0" Aa, +a Ax =o! Aw + 0° A a3. 


We have dw’ = 0 for the restrictions of forms to surfaces, which implies the 
required relation. 
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3.17 If the matrix of the first quadratic form is the identity, then oF = —Lo'!—Mo? 
and o3 = —Mo! — No. Therefore, 


-a! Aw we A oF = —w! A (—Ma! — No*) + @ A (—Lo! — Mo”) 


=(L+N)o! Aw =2Ho! Aa”. 


3.18 The geodesic curvature of any geodesic is zero; therefore, for a digon G, the 
Gauss—Bonnet formula takes the form 


[ kao =20 (a — a) — (4 — a2). 
G 


Moreover, —m < a — ag < a, and hence 27 — (x — a) — (m — a2) > O. Thus, 
the Gaussian curvature must take a positive value at some point of the domain G. 


3.19 First, we prove that the principal curvatures are constant on the whole surface. 
For this purpose, we differentiate the equation n; + kr; = 0 with respect to u? and 
the equation nz + kr2 = 0 with respect to u!. As a result, we obtain 742 + kar) + 
krj2 = O and nq, + ktr2 + kroy = 0. But nyg = nz, and rj2 = 121, whence 
kor; = kirz. Since the vectors r; and r2 are linearly independent, it follows that 
k, = kp = 0,1.e., both partial derivatives of the function k vanish. 

The constancy of k implies the vanishing of the derivative of the vector n + kr in 
any direction (on the surface) at each point of the surface; therefore, n + kr = a is 
a constant vector. Since r = “*, it follows that the given surface is the sphere of 
radius 1/k centered at a/k, because the vector n has length 1. 


3.20 The eigenvalues of the operator L are the principal curvatures k; and k2. This 
follows from Rodrigues’ formulas and the geometric definition of L; although, this 
is also seen directly from the definition L = BA7—!, where A and B are the matrices 
of the first and second quadratic forms. Indeed, it suffices to note that det(B —dA) = 
det(L — d/) - det A. 

Any operator satisfies its own characteristic equation; therefore, L?- (ky ko) L+ 
kikoI = 0,ie., L? -2HL+KI=0. 


3.21 First, we calculate I';;, = (rij, r). In the case under consideration, we have 


r) = (cosg,sing), r2 = (—rsing,rcosg), ri = (0,0), riz = ra = 
(— sing, cos@), and r22 = (—rcosg,—rsing). Hence Ty22 = T212 = r and 
I'921 = —r; the remaining symbols I’;;x are zero. Next, we have vy = a" Tia, 
where g!! = 1/r?, g?* = 1, and g!* = g*! = 0 (see Problem 3.1). Therefore, 
a — i, = 1/r and re = —r; the remaining symbols ri, are zero. 


3.22 First, we calculate Pj jx = (rij, rg). In our case, r; = (f’ cos v, f’ sin v, 1), 
r2 = (-fsinv, fcosv,0), ri = (f”"cosv, f”sinv,0), rig = roa = 
(—f’sinv, f’ cosv, 0), and r22 = (—f cosv, —f sinv, 0). Hence [y1; = f’ f”, 
Py22 = F212 = ff’, and T221 = — ff’; the remaining symbols I; ;z are zero. Next, 
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we have g!! = nH ge = aa and g!* = g?! — 0 (see Problem 3.2). Therefore, 

tc Oe. ae 2 .. 1 ic, HP ‘a k 
ry,= THe? My, = 13, = cae and P5, = — rea the remaining symbols CS 
are zero. 


2 = »P 
3.23 First, note that r; = rp a Therefore, 


_ OF» dy? _ dy? 
Vij =>: Aon YD otAaT. 
ox/ Ox Ox'dOxJ 
_ dy? dyP _  a2yP 
~ TPE ai Oxi p ax! dx 


dy! dy? ays 
=(F pats + Ppgn ae Ox! me Ox! dxs- 


On the other hand, rj; = =e Vk + bn and rg = 7,24 aa . The vectors 7; and rz are 
linearly independent and orthogonal to the vector n, which implies 


re Ou. _ ps Oy? ayt a°y* | 
Y axk Pq Oxt Oxt ax'dx/’ 


whence 


ae = _ dy? _ayt ays ax* 
ul PI 4x) Axi | Axtdxi ) dys 


3.24 Let T ‘ and c @ be the Christoffel symbols in local coordinates (x!, x”) and 
- a) If yk = = xk 4 chextxd, where the cf are constants, then 0) — 5 and 


a - a—taa7 (0) = Cj io Cj ;- Therefore, according to Problem 3.23, we have 


PE (0) = (4,057.54 + cf; +.c4,) 58 
= ri) + cf + Ce: 


Now, setting c= —5Ti (0), we obtain ri, (0) = 0, because DS -_ ie 
Another method for constructing a coordinate system for which all Christoffel 
symbols vanish at a given point is described in the solution of Problem 3.28. 


3.25 Choose a parameterization r(u!, u~) of the manifold M in a neighborhood 
of some point so that the vectors ry = or. and ry = 2 form an orthonormal 
basis at this point. Let us show that df (ar, + br2) = an, + bn2, where ny; and n2 
are the derivatives with respect to u! and u2 of the normal vector n(u!, u2) to the 
surface. Indeed, consider a curve r(u!(t), u?(t)) passing through the given point at 


t = 0. The velocity vector of this curve at this point equals (u!)'r, + (u2)'ro. The 
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Fig. 3.7 The net of the cone 


2y 


differential df maps this vector to the vector (u!)’ ny + (u*)'n2, as required. Thus, 
the entries of the matrix of the operator df in the basis r, rz are (nj, rj) = —Dij. 


3.26 Suppose that the radius of the circle in the intersection of the plane with the 
sphere is r sing. Then the length of this circle equals 27r sing and S = 2mr?7(1 — 
cos yg). Consider the cone tangent to the sphere along the given circle (which also 
serves as the base of the cone). Its slant height equals r tang. Consider the net of 
this cone (see Fig. 3.7). 

The required angle a between the vectors V; and V> is equal to the angle AOB. 
Indeed, the parallel transport from A to B along the arc AB takes the vector V; toa 
parallel vector V2. The vector at B corresponding to the vector V) is the vector BB, 
for which ZO BB, = ZOAA\; hence, at the point B, the angle between the vectors 
VY, = BB, and V> equals the angle AOB. 

The length of the arc AB is equal to that of the circle, i.e., 277 sin g. Therefore, 


2mr sing 
2m — a = ——— = 27 cos, 
r tang 


whence a = 27(1 — cos@) = Sir. 
Another solution of this problem is based on the corollary of Theorem 3.12. 


2 
3.27 Clearly, ,,(f) = 24 and 4,,,;(f) = =44,. Thus, the relation [r), r2] = 0 
follows from the equation asm = we. 
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> = tc* at a point p: 


3.28 Let us write an equation for a geodesic x! = tc!, x 
d?x! +r dxi dx* 7 
dp’ de dt 

a . 2,5 . ‘ 

In the case under consideration, ax = c' and cs = 0. Therefore, I" “ik cick = 0 for 

any numbers c’. Thus, My = 0. 

Another way of constructing a coordinate system for which all Christoffel 

symbols at a given point vanish is described in the solution of Problem 3.24. 


3.29 The parallels and meridians are orthogonal; therefore, it suffices to prove that 
the parallels are lines of curvature. Moving along a parallel, the unit normal vector n 
rotates about the axis of rotation of the surface; hence its derivative n’ is proportional 
to a velocity vector in cicular motion. 


3.30 Let n;(t) and n2(t) be the unit normal vectors to the surfaces at the points of 
the curve y(t). If y is a line of curvature for the first surface, then n) = A, y’. If 
the vectors n; and n2 are proportional (i.e., the surfaces are tangent along the curve 
y), then n, = + y’, and therefore y is a line of curvature for the second surface 
too. In what follows, we will assume that the vectors n; and n2 are not proportional. 
Since these vectors are perpendicular to y’, we have 


ny = Agy’ + pny + vn. 


It is required to prove that 44 = v = O. For the inner products of the expression for 
n’, with n; and nz, we have 


(n1,n) = e+ v(M1,Nn2),  (N2,n5) = W(n2, 1) + v. 


Clearly, (n2, ny) = 0 and (ny ’ n}) = (n1 ’ nz)’ a (n', n2) = —(n; ’ n2) = 
—(A,y’, n2) = 0. Hence 


m+ v(n1,n2)=0, w(n2,n1)+v=0. 


The inequality (71, nz)? < limplies wu =v =0. 


3.31 For an asymptotic line, the first equation of motion for the Darboux frame is 
E} = kge; therefore, 2 = -te2, where e2 is the second vector from the Frenet— 
Serret frame. Reversing, if necessary, the direction of the normal vector to the 
surface, we can achieve the equality e2 = e2. 


3.32 According to Euler’s formula, asymptotic directions correspond to those 
angles g for which k; cos? g+ko sin? yg = 0, ie., tan? g= -2. Moreover, the 
angle between asymptotic directions equals 2y¢. Thus, asymptotic directions are 
perpendicular if and only if kj = —k2,i.e., H = 0. 
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Fig. 3.8 A surface of revolution 


3.33 Consider the surface obtained by rotating the graph of y = f(x) about the 
Ox axis. According to Problem 3.29, the parallels and meridians of a surface of 
revolution are lines of curvature. Therefore, a surface of revolution is minimal if and 
only if the normal curvatures of the parallels and meridians have the same absolute 
value and opposite signs. It the normal vector n is chosen as in Fig. 3.8, then the 


normal curvature of the meridians is cE a 72 and the normal curvature of the 


parallels is -1 cos 6 (by virtue of Meusnier’s formula). Moreover, tan@ = y’. Asa 
result, we obtain the following equation for a minimal surface of revolution: 


y" _ 1 1 
GOP? » Geo 
There exists a point x for which y’(x) #4 0. Let us find a solution in a 


neighborhood of this point. To this end, we multiply both sides of the equation by 
2y': 


2y'y A 2y’ 
1+? y 
Setting z = 1+ (y’)’, we obtain z’ = 2y’y", whence 
7! 2y’ 


Therefore, Inz = In y* + Ina = Inay? and 1 4+ (y’)? = z = ay”. Clearly, a > 0; 


let a = k?. Then 
d 2 
ee (2) = (ky, 
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Le., 


kdy 


v (ky) —1 


Thus, arcosh(ky) = kx +c, ie, y = z cosh(kx +c), so that the derivative y’ can 
vanish only when kx + c = O. For the graphs corresponding to kx + c > O and 
kx +c < 0 to be smoothly glued, it is required that k and c be the same for both 
graphs. 


=kdx. 


3.34 According to Problem 3.20, the Weingarten operator L satisfies the relation 
L* = 2HL — KI, where H is the mean curvature, K is the Gaussian curvature, and 
/ is the identity operator. Thus, for a minimal surface, we have L* = — KI. 

The differential of the spherical Gauss map equals —L. Therefore, the spherical 
Gauss map transforms the inner product (V, W) on a minimal surface into the 
inner product (-LV,—-LW) = (L2V,W) = —K(V, W). If inner products are 
proportional, then they define equal angles between vectors. 


3.35 The required variation is given by the map (s, t) EXP, (7) (5 W(t)). 


3.36 It is sufficient to prove that each of the three coordinates in R? is a harmonic 
function on a minimal surface. Let x = (x!, x2, x3) denote a point of a minimal 
surface. Consider a moving orthonormal frame €1, €2 and let ¢3 be the normal vector 
to the surface. Then dx = we, + we2 and dx x €3 = —w'en + we, = *dx. 
Therefore, 


—d(*dx) = dx x de3 = (w'e| +.w*e2) x (whe1 + 0362) = (o! A wy — wo A w4)e3. 
Moreover, we have w! A w3 —w A ow} = 2Ha! A a? according to Problem 3.17. 


Hence (Ax)w! A w* = —d(*dx) = (2He3)o! A w’, i.e., Ax = 2He3. Thus, if 
H = 0, then the functions x!, x2, and x3 are harmonic on a minimal surface. 


Chapter 4 ®) 
Hypersurfaces in R”*+!: Connections od 


This is a transitional chapter from surfaces in three-dimensional space to Rie- 
mannian manifolds of any dimension. The intermediate object is a hypersurface, 
which is a direct generalization of a surface. If M” C R”*! is a submanifold of 
codimension 1, then M” is called a hypersurface in R"*!. In Chaps. 1 and 3 we 
dealt with hypersurfaces in R? and in R*. Now we will discuss (in Sects. 4.1-4.6) 
hypersurfaces of an arbitrary dimension. (Here and in what follows, all manifolds, 
functions, vector fields, vector bundles, sections of vector bundles, and so on are 
assumed to be smooth.) 

The rest of the chapter (Sects. 4.74.10) is concerned with the general theory 
of connections on vector bundles. The notion of a connection on a vector bundle 
generalizes those of covariant differentiation on a surface and of a connection on 
the tangent bundle of a hypersurface, which is introduced in Sect. 4.2. 


4.1 The Weingarten Operator 


If a hypersurface M = M” is orientable, then at each point x € M we can define a 
unit vector V(x) normal to M so that it depends smoothly on x. (If the hypersurface 
M is nonorientable, then such a family of vectors can be defined only locally.) With 
each vector X in the tangent space TM at a point x we associate the vector L(X) 
defined as follows. Let X = X‘e;, where e},..., €n+1 is the standard basis in Rt, 
Then —L(X) = 0xN = X‘Q,N is the derivative of N in the direction of X. It is 
easy to check that L(X) € 7).M. Indeed, differentiating the equation (NV, NV) = 1, 
we obtain 0 = dx(N, N) = 2(0xN, N). Thus, L is a self-map of the space 7}. M; 
clearly, it is linear, i.e., L is a linear operator. The operator L is called the Weingarten 
operator. 
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The Weingarten operator is closely related to the spherical Gauss map f: M > 
S”, which takes every point x € M to the vector N(x) treated as a point of the unit 
sphere. Note that —L(X) = df (X), because dN(X) = X'0;N = dyN. 

Example 4.1 For the standard sphere of radius R, the Weingarten operator equals 
+141, where / is the identity operator (the minus sign corresponds to the outward 
normal WN and the plus sign, to the inward one). 


Proof For the sphere, the Gauss map f has the form f(X) = +5X . oO 


A point of a hypersurface is said to be umbilical if the Weingarten operator L 
is proportional to the identity operator at this point. For example, all points of the 
sphere S” are umbilical. 


Theorem 4.1 The Weingarten operator L: T,.M —> T,M is self-adjoint with 
respect to the inner product in R"+! restricted to T;M; i.e., (L(X), Y) = (x, L(Y)) 
forany X,Y € T,M. 


Proof Clearly, (NV, X) = 0 and (N, Y) = 0. Therefore, 


0 = dx(N, Y) = (0xN, Y) + (WN, oxY), 
0 = dy(N, X) = (oy N, X) + (N, dy X). 


Thus, 
(L(X), Y) — (X, L(Y)) = —(@xN, Y) + (@yN, X) 
= (N, 0xY — dyX) = (N, [X, Y]) = 0, 
because the vector [X, Y] lies in the tangent space 7; M. Oo 


The Weingarten operator L makes it possible to define the first, second, third, 
etc. bilinear forms on T, M by setting the value of the kth bilinear form at a pair 
of vectors (X, Y) to equal (L‘-!X, Y). All these forms are symmetric, because the 
operator L is self-adjoint. The first bilinear form is merely the inner product of 
vectors. In differential geometry these bilinear forms are not as important as the 
corresponding first, second, third, etc. quadratic forms (L‘~! X, X). 

The determinant of the operator L is called the Gaussian curvature of the surface 
at the given point and denoted by K. The trace of the operator L is called the 
mean curvature and denoted by H. The eigenvalues of the operator L are called 
the principal curvatures. 


Remark For a hypersurface in R?, the definitions of the second quadratic form and 
the Gaussian curvature coincide with the corresponding definitions in Sect. 3.4. 
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Let k; be one of the principal curvatures, and let e; = a be the corresponding 


eigenvector. For a hypersurface, Rodrigues’ formula oN = —k; or remains valid. 
Indeed, 


7 = 0N = —L(e;) = —kje; = -h a. 

On a hypersurface M” consider an infinitesimal parallelotope spanned by vectors 
a\€}, ..., Ay,en. The spherical Gauss map takes it to the infinitesimal parallelotope 
with edges —k ae], ..., —ky,dnen. The ratio of the volumes of these parallelotopes 
is equal to the absolute value of the Gaussian curvature. Therefore, the ratio of the 
volume of the image under the spherical Gauss map of an infinitesimal figure on M” 
to the volume of this figure itself equals the absolute value of the Gaussian curvature 
of the hypersurface. 


Example 4.2 Consider a hypersurface given as the at of a function, i.e., as 
ae = f(xl, x”), and suppose that the vector (0, . :9, 1) is normal to it, 


i.e., oe 7 fF (0) = = 0 for all i. Then, in the local ciondinates: x!,..., x”, the second 


quadratic form at the origin is determined by the Hessian matrix ee 7 (0). 
Proof Clearly, W(X, Y) = (L(X), = ie 0 = (N, 0x Y). The hypersur- 


face is parametrically given by r = = (", Gay x”)). Therefore, the last 
coordinate of the vector r;(0) = “4(0) equale th ~7 (0), and the last coordinate of the 


i 


Si 
vector 0;,rj equals 5 (0). 5 


Example 4.3 For the hypersurface xntl — 5(k (x Wye +---+k, G2) with normal 
vector (0,..., 0, 1), the Weingarten operator at the origin is diagonal with diagonal 
elements k;. 


Any operator L satisfies the equation p(L) = 0, where p is the characteristic 
polynomial of L, i.e., p(A) = det(Z—AJ) (as usual, J denotes the identity operator). 
For n = 2 (i.e., for a surface in R*), the characteristic polynomial has the form 
i? — (trL)A + det L and, therefore, the Weingarten operator L satisfies the relation 
K-I—2H-L+L? = O. Thus, if I, IL, and Il are the first, second, and third 
quadratic forms, then 


K -I-2H-I1+M=0. 


In particular, for n = 2, the third quadratic form is expressed in terms of the first 
two; it is also clear that the remaining quadratic forms are expressed in terms of 
the first two as well. For a hypersurface of dimension n, all quadratic forms are 
expressed in terms of the first n quadratic forms. 
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4.2 Connections on Hypersurfaces 


Given any two vector fields X and Y in R"t!. the vector field dxY, that is, the 
derivative of Y in the direction of X, is defined. If X and Y are vector fields on a 
hypersurface M Cc R"*!, then the vector field 3x Y is not necessarily tangent to M. 
Consider the vector field on M defined as the orthogonal projection of the vector 
dxY on 7;M at each point x € M. The vector field thus obtained is called the 
covariant derivative of Y in the direction of X and denoted by Vx Y. 


Theorem 4.2 (Gauss’ Formula) Covariant derivative is given by the formula 
Vx¥ = dx¥ — (L(X), Y)N. 


Proof It suffices to check that the vector dy Y — (L(x), Y )N lies in the tangent 
plane, i.e., is orthogonal to N. The inner product of the vectors 0x Y — (L (X), Y)N 
and N equals 


(dx Y, N) — (Y, L(X)) = (OxY, N) + (Y, ax N) = Ox(¥, N) = 0. 


oO 
It follows from Gauss’ formula that the map V is linear in X and Y. Moreover, it 
obeys Leibniz’ rule 


MXC Y)= FVx¥ be OP) Y. 


A map with these properties which takes any two vector fields on a hypersurface to 
a third vector field is called a connection on M. 

Here we will consider not arbitrary but special connections, called the Levi-Civita 
connections, which have the following special properties: 


e VxY —VyX = [X, Y] (symmetry); 
« dx(Y, Z) = (VxY, Z) + (Y, Vx Z) (compatibility with the metric). 


Both these properties are easy to derive from Gauss’ formula. Indeed, since the 
Weingarten operator is self-adjoint, we have 


VxY — Vy X = 0x Y — (L(X), YN — dy X + (L(Y), XN 
= dy¥ — dyX = [X,Y 


and since the vector N is orthogonal to any tangent to the hypersurface, we have 


ax (Y, Z) = (OxY, Z) + (Y, 0xZ) 
= (VxY, Z) + (Y, VxZ). 
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Let y(t) be a parameterized curve on a hypersurface M, and let X be a vector field 
on M (for our purposes, it is sufficient that this vector field be defined on the curve 
y). The vector field X is said to be parallel along y if Vy X = 0 for Y = ae 

The geometric meaning of this notion is as follows. For a vector field X parallel 
along a curve, the vector dyX = ao must be orthogonal to the tangent space. 
Therefore, we first perform the parallel translation of X(t) from the point y(t) to 
the point y (t+ dt) and then project the translate onto the tangent plane at this point. 
As a result, we obtain a vector X(t + dt) for infinitesimal dt. Thus, the parallel 
transport along the curve changes a vector only in the normal direction; this change 
is necessary for the vector to remain tangent to the hypersurface. 


Problem 4.1 Prove that parallel transport along a curve preserves (a) the length of 
vectors; (b) the angles between vectors. 


A curve y ona hypersurface is said to be geodesic if the vector field oy is parallel 


along y, i.e., Vay = = 0. 
er ar at 


Example 4.4. Any curve y(t) on the sphere S” lying in a plane passing through the 
center of the sphere and having natural parameterization is geodesic. 


Proof Let X be the velocity vector of such a curve y(t). By virtue of Gauss’ 
formula, we have Vy X = dxX — (L(X), X)N. The section of the sphere S” by 
a plane through its center is the circle S!. For a point uniformly moving along the 
circle S!, the acceleration vector un = d0xX is directed from this point to the 
center of the circle. The center of S! coincides with that of the sphere; therefore, 
the vector dy X is directed along the normal to the sphere. The vector (L (Xx), X )N 
is directed along the normal as well, so that the tangent vector Vx X cannot have 
nonzero tangential component, and hence it must be zero. Oo 


4.4 Convex Hypersurfaces 


A convex hypersurface is a hypersurface which is the boundary of a bounded convex 
body in R”*!. A convex hypersurface lies on one side of the tangent space at each 
point. All principal curvatures of such a surface are of the same sign (although some 
of them may vanish). This follows from Examples 4.2 and 4.3 (see p. 147). 


Theorem 4.3 (Hadamard, 1897) [fall principal curvatures of a connected closed 
hypersurface immersed in R"t! (n > 2) are positive (or negative), then this 
hypersurface is convex (in particular, it is oriented and embedded rather than only 
immersed). 
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Proof We denote the given hypersurface by h(M”); here M” is a manifold and 
h: M" —> R"*! is its immersion. Let p € M” be a point, and let Tp be the tangent 
space to the hypersurface at the point h(p), which consists of the velocity vectors 
of the images of curves passing through p. It follows from the assumptions of the 
theorem that the image of a small neighborhood of p under the map / lies on one 
side of T,,. Let us direct the normal vector N(p) to the same side (then all principal 
curvatures are positive). This defines an orientation on the hypersurface, which 
makes it possible to define, in turn, the spherical Gauss map f: M” — S". Since 
all principal curvatures are nonzero, it follows that the map df is nondegenerate 
at each point and hence f is a local homeomorphism. Any local homeomorphism 
from a connected compact manifold to a simply connected manifold is a (global) 
homeomorphism (see, e.g., [Pr2]). Thus, for different points p and q of the manifold 
M", the corresponding normal vectors N(p) and N(q) are different. 

Consider the tangent space 7;,. On the hypersurface there are points which lie 
on the same side of it as the vector N(p). Let q be the farthest of them. Then 
N(q) = —N(p). We claim that the hypersurface has no points lying on the other 
side. Indeed, suppose that such points exist and let r be the farthest of them. Then 
N(:p) = N(r), but this is not so. 

Let us show that 4 is an embedding. Suppose that h(p) = h(p’) for a point 
p’ different from p. Then p’ is different from g, and hence the vector N(p’) is 
different from N(p) and N(q). Therefore, the tangent spaces 7, and T,, (passing 
through the point h(p) = h(p’)) do not coincide. This implies that the hypersurface 
contains points on both sides of the tangent space T,,, which is false. 

The convexity of the hypersurface follows from the fact that it lies on one side of 
every tangent space. oO 


4.5 Minimal Hypersurfaces 


A minimal hypersurface is a surface with zero mean curvature. As well as in the 
case of surfaces in R?, this name reflects the fact that any normal variation (that is, 
a variation in the normal direction) of a minimal hypersurface increases its volume. 

To calculate the volume of a hypersurface (or of its part), we use the volume 
form. Let €),..., &, be a moving orthonormal positively oriented frame on a given 
hypersurface (or on some domain of this hypersurface), and let w!, ..., w” be the 
dual frame. Then the volume form on the hypersurface is @! A--- Aw". 

Let us find out how the volume form transforms under a change of basis. Consider 
another basis e; = a} é;. The matrix A = (a} ) is the transition matrix from the 
basis € to the basis e. Let ok = bia? where « is the dual basis for e. Then bk — 
bh a? (ei) = w(e:) = ajw*(e;) = af. Therefore, wo! A--- Aw” = (det A)a! A 
--» A a", The matrix g of the first quadratic form in the basis e equals AA’, so that 


det A = +./det g. If both bases are oriented in the same way, then det A = ./det g 
andw! A--- Aw" = Jfdetga! A--- Aa". 
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We are interested in small normal variations of a bounded domain of a hypersur- 
face. Therefore, we assume that M” is a bounded hypersurface with boundary given 
in the parametric form r(u!,..., u”) € R"*+!. A normal variation of this hypersur- 
face is a family of hypersurfaces M/’ of the form r;(u) = r(u) + th(u)N(u), where 
h is a function vanishing in a neighborhood of the boundary of the hypersurface and 
N(u) is the unit normal to r. Consider the function vol(t) = i mM dV;, where dV; is 


the volume form on the hypersurface M7’. Let us show that 
vol’ (0) = -{ nhH dVo. (4.1) 


We will use the fact that if g(t) is a family of matrices in which g(0) is the identity 
matrix, then 4 det g(0) = 4 tre (0) (see Appendix, Theorem 8.1 on p. 257). 

From (4.1) it follows that a hypersurface is minimal if and only if vol’(0) = 0 
for any normal variation of any of its domains. This is proved in precisely the same 
way as for surfaces (see p. 109). If a hypersurface is not minimal, then there exists 
a normal variation reducing its volume. 

Consider the map f;: Mj — M7’ taking each point r(u) to the point r;(u). 


Suppose that the vector ¢; is represented by a curve y;(s), ie., €; = eG) sf 
‘ s=' 
Then the map f; takes ¢; to the vector 
dr; (vis 
ej(t) = dri(vi(s)) =e +thN; + th;N, 
ds s=0 
where N; = aN tyts) ; and h; = AUG) | . Therefore, in the basis ¢;(t) the 
a 


matrix of the wi re form on the pe ee M? is 
gij(t) = (ei (t), €;@) = aij O)+thl (ei, Nj)+(e;, Ni)] = 63; -2th(ei, Lej)+o(t). 


Clearly, 


/ dV; = / fa; = / fi (det g(a! (t) A+-+Ao"(t)) 
= = f;(M") M" M" 
= i Jdet g(tho! A--- Aw" = 7 det g(t)dVo. 
Mn Mn 


Hence 


gore 


vol'(0) = =f, y det g(t) ae 
=5 ah yD gi, (O)dVo. 


hia ae: 
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We can assume that the vectors ¢; form an orthonormal eigenbasis of the self-adjoint 
operator L. Then 


g;; (0) = —2h(e;, Le) = —2kjh, 


and therefore 


n 


vol’ (0) = -f{ ) kihdVo = -| nHhdVo. 
Mo 
L 


4.6 Steiner’s Formula 


Consider a body bounded by an embedded oriented hypersurface M” C R"*!. Let 
V be the volume of this body, and let X be its surface area (i.e., the volume of 
the manifold M”). Consider also the set M? of those points that are obtained by 
translating a point x € M” along the normal to the hypersurface at this point by 
a distance of ¢. For sufficiently small ¢, the set M? is an embedded hypersurface; 
in what follows, we assume that ¢ is small enough. Let V(e) be the volume of 
the body bounded by this hypersurface, and let X(¢) be its surface area. Clearly, 
Vyr)=Vt fo X (€)deé (the plus sign corresponds to the outward normal and the 
minus sign corresponds to the inward one). Therefore, if we manage to calculate the 
volume of the manifold M?, then we will be able to calculate V(r) by integration 
with respect to e. 

To calculate the volume of the manifold M?, consider the spherical Gauss map!. 
The ratio of the volume of an infinitesimal figure on the hypersurface M?’ to the 
volume of the image of this figure under the spherical Gauss map equals 1/|K|, 
where K is the Gaussian curvature (see p. 147). Therefore, X (¢) = Is wor where 
dV is the volume form and K (¢) is the Gaussian curvature of the point of M? that 
is mapped to the corresponding point of the sphere. 

The quantity 1/K, where K is the Gaussian curvature, equals the product of the 
principal curvature radii. We also have Rj(¢) = Rj + €, where R; = R;(0) is the 
curvature radius of the initial hypersurface. Let us transport the integral along the 
unit sphere backward but to M5 = M” rather than to M2. As a result, we will 
represent X (e) as the integral of [] ae = [](1 + ek;) over M”. Multiplying out, 
we obtain a polynomial in ¢ whose coefficients are symmetric polynomials in the 
principal curvatures. 

For example, consider a surface M7 in R?. We have 


X(e)= i; A + (ky + ky) + 7kiko)do, 
M 


' This approach to calculating the volume of M” was communicated to me by A. G. Kulakov. 
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where do is the area form on the surface. The product k;k2 is the Gaussian curvature 
K. Therefore, by the Gauss—Bonnet theorem, Swe kikygdo = 2x (M2). For 
example, if the surface is convex, then f m2 kiko do = 41. Moreover, kj +k2 = 2H, 
where H is the mean curvature. Thus, 


X(e) = X(0)+2e | Hdo +2672 x(M’). 
M2 


Integrating this formula with respect to ¢, we obtain 
2 32m 2 
V(e) = VO) +eX(O) +e Hdo+e 3 X(M ). 
M2 


HISTORICAL COMMENT In 1840 Steiner obtained formulas for the volume and 
surface area of the body B, consisting of all points at a distance of at most ¢ from 
a convex body B in the plane or three-dimensional space. To many-dimensional 
bodies this result was generalized by Hermann Wey] (1885-1955) in 1939. 


4.7 Connections on Vector Bundles 


An attempt to calculate the derivative of a vector field (e.g., the acceleration of a 
parameterized curve) on a manifold right away runs into the difficulty involved in 
the necessity to consider differences of vectors lying in different tangent spaces. 
Such a differentiation requires a method for identifying different tangent spaces. 

A vector field on a manifold is a special case of a section of a vector bundle 
over a manifold (a vector field is a section of the tangent bundle). A connection 
on a vector bundle associates each section with its covariant derivative in the 
direction of a vector field. Covariant differentiation makes it possible to define the 
parallel transport of a fiber along a curve (see p. 156). Using parallel transport, we 
can identify different fibers, transporting them along a curve. This identification 
connects different fibers—that is where the name connection comes from. A 
connection is completely determined by this identification of fibers: using the 
parallel transport of a fiber along a curve, we can define the differentiation of a 
fiber in the direction of a vector field, and this differentiation coincides with the 
ordinary one (see p. 157). 

Let M” be a manifold. A connection V on M” is a map taking each pair of vector 
fields X and Y on M” to a vector field Vx Y on M” so that this map is linear in X 
and Y and, for any function f on M", V¢x¥ = fVx¥ and 


Vx(fY) = fVxY +(x f)Y (Leibniz’ rule). 


The vector field Vx Y is called the covariant derivative of Y in the direction of X. 
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In a similar way we can define a connection on any vector bundle € over a 
manifold M”. Namely, a connection V on a bundle & associates a vector field X 
on the manifold M” and a section s of the bundle € with a section Vx(s) of the 
bundle & so that this correspondence is linear in X and s and, for any function f on 
M”, the relation Vx(s) = fVx(s) and Leibniz’ rule 


Vx (fs) = fVx(s) + (x fs 


hold. Thus, a connection on a manifold is a connection on its tangent bundle. 


Problem 4.2 Let V! and V7 be connections on a bundle é, and let A and jz be 
constants. Prove that AV! + wV? is a connection if and only ifA+u=1. 


Example 4.5 Consider a trivial bundle with fiber R”. Each section s is determined 
by a set of n functions gs!) ..., s” for which s(x) = > s'(x)e;(x). The formula 
Vx(s) = >>(0xs')e; defines a connection. 


Proof The relation Vx (s) = fVx(s) holds because d¢x = fdyx. Since 
dx(fs') = f(dxs') + (dx f)s', it follows that Vx (fs) = fVx(s)+ (Ox f)s. OU 


At every point p € M” the section Vx(s) is completely determined by the 
restrictions of X and s to any neighborhood U of p. Indeed, take a neighborhood V 
of p whose closure is contained in U. Let us construct a function f on M” so that 
f(x) = 1 forx € V and f(x) = 0 for x ¢ U. The equality Vrx(s) = fVx(s) 
shows that Vx(s) at p is completely determined by the restriction of X to U. Since 
the function f is constant in a neighborhood of p, it follows that (dy f)(p) = 0. 
Therefore, according to Leibniz’ rule, we have Vx (fs) = Vx(s) at the point p. The 
section fs is completely determined by the restriction of s to U. 

Thus, it makes sense to speak of the restriction of a connection to an open set U C 
M" In particular, it makes sense to speak about a connection in a given coordinate 
neighborhood over which the bundle is trivial. 


Problem 4.3 Prove that on any vector bundle over a manifold M” there exists at 
least one connection. 


Consider a trivial d-dimensional vector bundle € over a neighborhood U in R”. 
Let e1,..., eg be a basis in the fiber of &, and let €1, ..., €, be a basis in R”. Then 
a section s is determined by functions a 69 ae s4(x) for which s = slej, and a 
vector field X is determined by functions X lax), ..., X"(x) for which X = X‘¢;. 
Let V be any connection on the bundle €. Consider the functions T° t (x), where 
i = 1,...,n and j,k = 1,...,d, defined as the coefficients in the expansion 
Ve,ej = ri, ex of the section V,,e; in the basis e1, ..., eg. 


The functions ri. are completely determined by the connection V. Indeed, 


i ; : ; as/ 
Vx(s) = X' Ve, (s/e;) and Ve,(s/e;) = 8! Ve,ej + agit 
x 
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The functions ri, are called the Christoffel symbols. 

We have already encountered the Christoffel symbols in the special case of a 
surface in space. Those Christoffel symbols were symmetric: rs; k =i Now we 
cannot hope for symmetry, because the indices i and j range over sets of different 
cardinalities. But even in the case where d = n, the Christoffel symbols may be 
asymmetric; for this reason, a special torsion tensor is introduced, which is designed 
just for measuring their asymmetry. 

In what follows, in studying Riemannian manifolds, we will deal with the Levi- 
Civita connection, which is constructed from the Riemannian metric. For the Levi- 
Civita connection, the Christoffel symbols are symmetric. Therefore, we do have to 
monitor the order of indices i and j in the symbols Les but only in this chapter. 
Note that there is no convention concerning the order of writing the subscripts i and 
j. The formula Ve,e; = T ‘ ex for asymmetric symbols Christoffel is also frequent 
in the literature. 


Theorem 4.4 Let i and T |, be the Christoffel symbols of a connection on the 
tangent bundle of a manifold M" with respect to local coordinates x and y. Then 
, Oy’ ax! axt dy’ a2xk 


ea 
18ST BR ByF Ay ORE By | 


Proof Let e, = sr and é, = ra be the basis vectors for the local coordinates x 


2 k 
and y. Then é; = ek. Therefore, 


We fix r, s, and ¢, multiply this equation by & ~ for k = 1,..., , and sum the 
resulting relations. Again applying the same cae we obtain the required relation. 
oO 
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Covariant differentiation in the dual bundle, i.e., the covariant differentiation of 
1-forms, is defined so that 


dx (@(Y)) = (Vx@)(Y) + o(VxY). 
Thus, the definition of the covariant derivative Vxw is as follows: 
(Vxo)(Y) = dx((Y)) — o(VxY). 


HISTORICAL COMMENT The theory of connections on vector bundles was devel- 
oped by Jean-Louis Koszul (1921-2018) in 1950. 


4.8 Geodesics 


The equation Vyr,,(s) = x Vz,(s) shows that if we are interested in Vx(s) at a 
point p, then it suffices to know the vector field X only at the point p; the behavior 
of X in a neighborhood of p is of no interest. Moreover, if X = y’ and we are 
interested in Vy(s) on some parameterized curve y: (a,b) > M", then it suffices 
to know only the restriction of s to the curve y; the behavior of the section s in a 
neighborhood of y is of no interest. Indeed, let y(t) = x! (t)e;. Then y(t) = ae 6; 
and 


dx' asi dx! 
dt ax! Fi 


dx' asi dx! kay ds! dx' kpJ 
~ (Fate r)e) ~ (te vik J 64 


Thus, to the section s we can assign the section V,/(s), which is defined on the 
curve y and depends only on the restriction of s to y. 

We say that a section s is parallel along a curve y if V,/(s) = 0 at each point of 
y. In local coordinates this condition is written as the system of linear differential 
equations 


Vi(8) = Vast, 64e;) = 


j » dx! 
“+ 1 st =0, j=l,...,dimé. (4.2) 
In an interval (a, b) this system has precisely one solution satisfying a given initial 
condition s(to), to € (a, b). Therefore, if a bundle € is equipped with a connection 
and points p and q are joined by a curve on the manifold, then the fibers p and g 
can be identified. This identification is an isomorphism of vector spaces. Indeed, if 
sections s; and s2 satisfy Eq. (4.2), then so does any section of the form As; + ps2, 
where 2 and yu are constants. 
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Given any vector bundle p: E — M” over a manifold M”, the tangent space 
at each point e of the total space E contains a distinguished vertical space, that 
is, the tangent space to the fiber over the point x = p(e) € M”. In the general 
case, there is no distinguished horizontal space complementary to the vertical one. 
But a connection can be used to single out a horizontal space by means of parallel 
transport. For every vector X € TM", consider a curve y(t) determining it, for 
which y(0) = x and y’(0) = X. The section passing through a point e € E and 
parallel along the curve y is a curve ex(t) in the space EF. The tangent vectors at 
t = 0 to such curves generate a horizontal space at the point e. 

Using a connection, we can identify the fibers of a bundle over the endpoints p 
and g of some curve (the identification depends on the curve). In turn, the covariant 
derivative Vs can be expressed in terms of the parallel transport of a section along 
a curve as follows. First, we choose a curve y(t) for which y’(0) = X is the given 
vector. Let s(t) be the section s over the point y(t), and let s, be a section which is 
transported along the curve y and coincides with the section s over the point y (é). 
Then 


Vxs = lim S(O) — (0) 
E> 0 E 


(4.3) 


Here s, (0) is the parallel transport to the point y (0) of the section s over y (e). 

To prove (4.3), we choose a basis e1, ..., eg in the fiber of the bundle over 
some point of y and transport it along the entire curve. We denote the obtained 
sections over the points y(t) by e1(f), ..., eg(t). These sections form a basis over 
each point, and the section s can be expanded in this basis: s(t) = s! (t)e;(t). 
Over the point y(e) this expansion has the form s(¢) = s'(e)e;(e). Since the 
section s, is obtained by the transport along the curve of the section over y(é), it 
follows that the expansion coefficients are constant and the expansion has the form 
Se(t) = s'(e€)e;(t). Thus, se(0) = s‘(e)e; (0) and 


_ s6(0)—sO) .__ si(e) — s'(0) 
in A —_ = iim ————_— 
e>0 E ée>0 E 


ds! 
e(0) = det (0). 
Since the vector fields e;(t) are parallel along the curve y, we have V,/e;(t) = 0. 
Hence 


: ds' 
Vxs = Vyrs (t)e; (t) a = G7 Oa. 


This completes the proof of formula (4.3). 

Now consider the case where the bundle under consideration is a tangent bundle. 
Suppose given a connection V on a manifold M”. A curve y on M” is called a 
geodesic (with respect to the connection V) if Vyi(y’ ) = 0 for all points of y. 
In other words, a geodesic is determined by the condition that the velocity vector 
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is parallel along the curve y. In local coordinates a geodesic is determined by the 
system of differential equations 
d*xi j dx' dx* 
de dt dt 


=0, j=1,...,n. (4.4) 


A geodesic is determined by a system of n second-order differential equations. 
This system is equivalent to the following system of 2n first-order equations: 


dxi ' 

— = 0, fo ee (4.5) 
dvi pax dx* . 

—=Ti—:—, JH... 

dt ‘dt dt 


Therefore, for each point (x9, v9) € R2", we can choose an ¢ > 0 so that if a 
point (x1, v;) lies in the ¢-neighborhood of (x9, vo), then, for |t| < ¢, system (4.5) 
has a unique solution ¢ +» x(t) satisfying the initial conditions x(0) = x; and 
& ) = v1. This solution smoothly depends on x1, v1, and f. 

The geodesics have the following homogeneity property: if x(t) is a geodesic, 


dict so-ie.a (Ch) Indice ek) = ae, Then 2 = oe cad a 
: > y —_ : dt —_ dt dt ‘dt2* 


Therefore, if x(t) satisfies system (4.4), then y(f) satisfies this system as well. 

Let us draw a geodesic yy (t) for which dy (0) = V and yy (0) = x through a 
point x € M”. The geodesic yy(t) is uniquely determined for small t, but it may 
happen that it admits no extension to large ¢ values (if such an extension exists, then 
it is unique). If yy (t) is defined for t = 1, then we set exp, (V) = yy (1). 


Theorem 4.5 For any point x € M", there exists a neighborhood U of zero in 
T;M" such that the map exp, is defined on U and is a diffeomorphism from a 
neighborhood of zero in the tangent space at x to a neighborhood of the point x in 
the manifold M". 


Proof We have already proved that there exists an ¢ > 0 and a neighborhood U; of 
zero in T, M” such that yy (t) is defined for |t| < ¢ and V € U;. Let U2 be the image 
of U, under the contraction map V +> eV. Then, for each vector W = eV € Ud, 
the homogeneity property implies that yw (1) = yev(1) = yv(e) is defined. Thus, 
on U2 amap V +> exp,(V) is defined; this map is smooth. 

In the space 7M” consider the curve a(t) = tV, where V is a fixed vector. 
For t = 0, the velocity vector of this curve equals V. The map exp, takes this 
curve to the curve y,y(1) = yy (t). For t = 0, the velocity vector of yy (t) on the 
manifold M” equals V. Thus, the differential of exp, at the point 0 is the identity 
operator on the space 7, M”. Therefore, we can apply the inverse function theorem 
and find a neighborhood U of 0 in 7, M” such that the restriction of exp, to U isa 
diffeomorphism. Oo 
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4.9 The Curvature Tensor and the Torsion Tensor 


Before proceeding to discuss the curvature and torsion tensors, we give a general 
definition of a tensor. We begin with tensors on a vector space V. Under a change of 
basis é; = Al e; the coordinates of each vector change according to the contravariant 
law do! = (Aq hi v/, and the coordinates of each covector change according to the 


covariant law é; = A} e; covariant” means “varying in the same way as under 
direct change of basis,” and “contravariant” means “varying in the same way as 
under inverse change of basis”). The components of a tensor of type (p, g), where 
p is the number contravariant indices and q is the number of covariant ones, change 
according to the law 
Fi grt cache gdt ada itp 
a eae ip Ay ABT ig: 

A tensor of type (p, g) is an element of the tensor product of p copies of the space 
V and q copies of the dual space V*. 

A tensor bundle of type (p, g) over a manifold M” is a bundle whose fiber over a 
point x € M” is the tensor product of p copies of the space V = T, M” and qg copies 
of the dual space V*. For each coordinate neighborhood U C M” with coordinate 
functions x) ..., x”, the bases = of the space 7; M” and dx’ of the dual space 
determine bases in the fibers over the points of the neighborhood U. A tensor field 
is a section of some tensor bundle. The tensor field in a coordinate neighborhood U 
has the form 


ae ee) 0 j 
ij ...1p Q---® —@Qdx!'@---@dx". 


Jie dq Oxil Ox!P 


Under a change of coordinates the components of a tensor change as follows. Let 


TO); be the components of a tensor in local coordinates x!) ..., x”, and let 
TO) 6a be the components of the same tensor in local coordinates y!, ..., y”. 
Then 


it ..ip dy” ay Oxi axa 
fmda Qt” Byte By” ByPa 


1 ...Qp 


TY). . By = T(x) 


A tensor field is often briefly referred to simply as a tensor. 
Example 4.6 The Kronecker symbol 3 is a tensor of type (1, 1). 


Proof We must check that 


;  Ay® axd 
a = es 
BO = HOTT 
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where 54(y) = 64 and 6/ (x) = 87. Clearly, 


yO" aed _ayt at ag 
i ax! dyB Axi ayB FP 


oO 


A tensor field of type (1, 1) can be regarded as a linear operator on the tangent 
space at each point. Then the Kronecker symbol is the identity self-map of every 
tangent space. 


or or 


Example 4.7 The Riemannian metric gjj = (4. ant. 


1 


) on a hypersurface 
r(x°,...,x”) is a tensor of type (0, 2). 


Proof The required equation 


ax/) Ax? 
8B1,p.(Y) = Sih OTR ayB 
follows from 
or o ar ax 
dyB Axi ayB 


Oo 


We also need the notion of the commutator of vector fields on a manifold. In 
local coordinates the commutator [X, Y] of vector fields X and Y is defined in the 
same way as the commutator of vector fields in Euclidean space: 


An invariant definition of a commutator can be obtained by considering a vector 
field X as an operator dy which takes each function f to its derivative 0x f in the 
direction of a vector field X. Under this approach the commutator [X, Y] acts as the 
operator 0x dy — dy dx. 

Now we can define the curvature tensor and the torsion tensor for a connection 
V on a tangent bundle. The torsion tensor T of type (1, 2) and the curvature tensor 
R of type (1, 3) are defined at given vector fields X, Y, and Z by 


T(X, Y) = VxY¥ — Vy X — [X,Y], 


R(X, Y)Z = VxVyZ — VyVxZ — Vx, ¥1Z. 
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Let us check that T and R are indeed tensors of the specified types. We begin 
with the torsion tensor. For X = x'd; and Y = y/0;, we have 


VxY = x! y/V9,9; +x! (0y/)d; = x! yITF 9 +x! (d:y/)d;, 
VyX = x yi DE a + yl (8;x!)0;. 
Therefore, 
VxY¥ —VyX —[X, Y]= x'yi(h — Ti )oe- (4.6) 


Let T(0;, dj) = Th dx. Then Ti = ri, _ ri. By virtue of Theorem 4.4 on p. 155, 
the Christoffel caltcls ri. (x) aid mM 0) in local coordinates x and y are related 
by 


a i a J ayt a2 k 
roy rhoy iy, o2t det, Ot oat 
axk dy" dy’ axk dy"dys 


Therefore, the Christoffel pee themselves are not tensor components, but their 
differences Ts = =Tf - re ji We. 

A connection whose torsion identically vanishes is said to be symmetric. 
Problem 4.4 Prove that the torsion at a point p is zero if and only local coordinates 


in a neighborhood of p can be chosen so that ry. (p) = O for alli, j,k. 


The fact that the torsion is a tensor can be proved without using an explicit 
expression for the transformation of the Christoffel symbols under a change of 
coordinates. It follows directly from formula (4.6) that the torsion is bilinear, that 
is, T(fX,gY) = fgT (X,Y) for any functions f and g. Let us show that the 
bilinearity of the torsion implies that the torsion is a tensor. Under a change of 
local coordinates the coordinates of vectors and the coordinate vectors themselves 


vary according to the laws u! (x) = u “(y) ea and 0;(x) = dg(y) or Therefore, 


u* (yu? (y) Tig) (y) = oe 
=u “oe Ones a ST (x) Ap) 


and hence 


ax! ax! ay” 
T, = Tj —_——S =: 
ap) = Tj) a ayP axk 
as required. 
We have already performed all needed calculations for the curvature tensor in the 
proof of Theorem 3.15 on p. 99; in that proof the trilinearity of the curvature was 
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shown, which implies that the curvature components R jk Can be determined from 
the equation 


R(X Aj, Vag) (Z' Aj) = XIV*Z' Ri, A. 


The trilinearity of the curvature implies that the curvature is a tensor (the argument 
is precisely the same as in the proof that the bilinearity of the torsion implies that 
the torsion is a tensor). 

A simple calculation shows that 


(4.7) 


The only difference between this calculation and that in the previous section is that 
previously the Christoffel symbols were symmetric and the order of subscripts did 
not matter, while now this order must be taken into account. 

It is seen directly from the definition of the tensors T and R that T(X, Y) = 
—T(Y, X) and R(X, Y)Z = —R(Y, X)Z. For the components of the tensors T and 
R, this means that Ti; = —Tj, and Rij = — Rig; In the case of a connection with 
zero torsion, there also arise some other relations between the components of the 
curvature tensor. 


Theorem 4.6 (First Bianchi Identity) /fT = 0, then 


R(X, Y)Z + R(Z,X)¥ + RW, Z)X =0, i.e, Rigg + Ruj + Rei =0. 
First Proof Choose local coordinates in a neighborhood of a given point so that all 
Christoffel symbols vanish at this point (for a torsion-free connection, Problem 4.4 
can be used). Then relation (4.7) takes the form 


R= aT = oy 
ik Axi axk 
Th po ar, art, po. ary, rh . 
erefore, Rij = yar = ger and Rix ge aor Summing these three 
equations and again using the symmetry of the Christoffel symbols, we obtain the 
required identity. Oo 


Second Proof We must prove that if Vy V — VyU = [U, V] for any vector fields 
U and V, then the sum of the three expressions 


Vy VxZ — VxVyZ — Viy,x]Z, 


VzVyX — VyVzX — Viz,y)X, 


Vx VzZY —VzVxY — Vix,z\Y 
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vanishes. First, note that 


Vx(VzZY — VyZ)+ Vy(VxZ — VzX) + Vz(VyX — VxY) 
= Vx[Z, Y] + VyLX, Z] + Vz[Y, X]. 


Next, 


VxIZ, Y] — Viz,yjX + VyLX, Z] — Vix,z)¥ + VzlY, X)] — Viy,x1Z 
= [X,[Z, YJ] +[Y, LX, Z]] + [Z, LY, X]] =0 


by virtue of the Jacobi identity (see Appendix, Theorem 8.3). Oo 


HISTORICAL COMMENT In 1902 Luigi Bianchi (1856-1928) proved two identities 
for the Riemann tensor. The second Bianchi identity is discussed on p. 166. 


4.10 The Curvature Matrix of a Connection 


Recall that a connection V on a bundle é is a map which takes every pair consisting 
of a vector field X on a manifold M” and a section s of a bundle & to a section 
Vx(s) of the bundle & so that this map is linear in X and s and, for any function f 
on M”", the equation V;x(s) = fVx(s) and the Leibniz’ rule 


Vx (fs) = fVx(s) + Ox f)s 


hold. 

A connection V can be regarded as a map V: r(é) — T(T* @ &), where ['(é) 
is the space of sections of the bundle € and 7* is the cotangent bundle of the 
manifold M”, i.e., the space of 1-forms. Indeed, note that, given a vector field X 
and a tensor Vs = w! @ si, we can define the section w! (X)s;. Therefore, given 
a map V: r(é) — T(T* @ &), we can assign the section w! (X)s; to each vector 
field X and any section s. Clearly, this correspondence is linear in X, and for any 
function f on M”, we have w! (f X) = fa! (X). For the map (X, 5) bk w! (X)s; to 
be a connection, it is required that the map V have the properties 


V(s1 +52) = Vs1 + Vso, 
V(fs) = fVst+df @s. 


The first property means that the correspondence is linear in s, and the second one 
ensures the fulfillment of Leibniz’ rule: it suffices to note that 0x f = (df)(X). 
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Now suppose given a connection V assigning a section Vys to any vector field 
X and section s. Choose local coordinates x!, ..., x” on the manifold M” and 


consider the vector fields e; = a in these local coordinates. We set Vs = w! @ Si, 


where w! (e p= 5 and s; = V-.,s. Let us check that Vs does not depend on the 
choice of local coordinates. Consider other local coordinates 9, ..., y” in the same 
neighborhood. Let ex = 5% = Ajei, and let 6'(e;) = 54. Then w! = A;,6*. Thus, 
we must show that 


AXO* @ Ve, =O @ Vai.,- 


This follows from the linearity of Vys in X (with respect to multiplication by 
functions rather than constants). 

We introduced the notation V just not to get lost in the proof. In what follows, 
we will use the same symbol for the assignment of a section to a vector field X and 
a section s and for the corresponding map '(é) > P'(T* @ &). 

A connection on a trivial bundle over a neighborhood U in R” is determined by 
a matrix w = (w! ), whose entries serve as differential 1-forms on U. Indeed, let e1, 
..., €q be the basis sections of a bundle &, and let e be the column with components 


€1,..., €g. Then Ve; = wo; ® ej, so that Ve = w @ e. The matrix is called the 
connection matrix. This matrix is related to the Christoffel symbols as T° i‘ — wh (€;), 
where €1,..., &, is the basis of R”. Indeed, , 


k k k 
Dyjpek = Vevej = Vej(Ei) = (@; @ ex) (Ei) = 0; (Ei) ek. 
Thus, a connection can be defined by a connection matrix. Let e’ = Ae be another 
set of sections, where A is a square matrix whose entries are functions A‘, and let 
w’ be the corresponding connection matrix. Then 


w =dA-A~!+AoAq!. (4.8) 


Indeed, in view of the properties of a connection, we have V(Ae) = AVe+dA @e. 
On the other hand, V(Ae) = Ve’ = a! @ e’. Therefore, 


ow @e' = AVe+dA@e = Aw@A'e' +dA@A!e! = (AWA7!4+dA-A7')@e’. 

Now let us express the curvature tensor in terms of the connection matrix. 
This expression contains the differentials of the 1-forms w'. We use the following 
expression for the differential of a 1-form w: , 


dw(X, Y) = dx(@(Y)) — dy (@(X)) — a(LX, Y)) 


(see Appendix, Theorem 8.4). We will also use the relation Vxe; = wo! (X)e;, which 


follows directly from the equation Ve; = wo! @ ej. 
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Clearly, 
Vx Vye; = Vxe! (Ye; = ax (w) (Y))ej +o! (Y)Vxe; 
= dx (wo! (Y))ej + of (Yok (Xe 
= dx(w! (Y))e; + oF (Ya) (X)e;. 
Therefore, 


R(X, Ye; = (VxVy — Vy Vx — Vixyeéi 
= (ax (w) (Y)) — ay @/ (X)) 
— w) (X,Y) — of (X)o@jJ (Y) + of (Y)a] (X))e;- 
Here 
of (Xo) (Y) + of Yo) (X) = —(0(Xo(¥) — oP 0(X)). 


To show that matrices whose entries have super- and subscripts are indeed multiplied 
by this rule, consider two basis changes, e’ = Ae (e, = Ajej) and e” = Be’ 
(e/ = Bke'). The transformation e” = Be’ = BAe for the matrix entries is written 
ase! = Bre! = BK Ale;,ie., (BA)} = BAI. 

The expression w(X)w(Y) — w(Y)w(X) can be denoted by w A w(X, Y). Then 
R(X, Y)e; = 2) (X, Y)e;, where Q) = da! —(w Aa)!. 
Remark For p > 1,a pairing between the spaces A? V and A? V* can be defined in 
different ways. If the pairing is assumed to originate from the pairing between the 
dual spaces V @--- @ V and V* @--- @ V%, then it has the form 


1 [@LG) ++. ep) 
(M1 A-+-A@p)(QX1 A+++ AXp) = Sill, aeons 


‘ |ai(%p) ... @p(Xp) 
But usually the pairing is defined by 


1 (X1) ... Mp(X1) 
(M1 A+++ A@p)A1L A+ + AXp) =| wee eee 


M1 (Xp)... Op(Xp) 
This corresponds to the pairing between the spaces of tensors multiplied by p!. 


The matrix Q = dw — w A ow of 2-forms is called the curvature matrix of 
the connection. Let us check that, choosing another set of sections e’ = Ae, we 


166 4 Hypersurfaces in R’+!: Connections 


obtain the matrix Q’ = AQA7!, First, note that dA~'=—-A7'dA. Au!, because 
d(A~!A) =0. Equation (4.8) can be written in the form 


wWA=dA+Ao. 


Let us differentiate this equation, having in mind that w’ is a 1-form and A is a 
0-form: 


dw’-A-w' AdA=dAdAw+ Ado. 
Therefore, 


dw! = (a A\dA+dAAw+ Adw)A~! 
=dA-A 'adA-A'!+AwA !AdA-A'+dAA@A! + AdwA!. 


It is also clear that 


ow! Aw = (dA-A7!+ AwA7!) A (dA- AT! + AwA7!) 
=dA-A~!AdA-A!+AwA7! AdA- Aq! 
+dA-A7~!/” AwA!+ AwA! A AwA™!, 


Thus, 
Q' = do! — a! Aw! = AdwA — Aw A\wA! = AQAM!, 


Theorem 4.7 (Second Bianchi Identity) dQ =wAQ-QAwo. 
Proof Clearly, 


dQ = d(dw-—@Aw) 
= —-dw\o+toddw 
= (-Q-AA0)AO+ 0A (Q+ 0A 0) 


=H=WAQ-L Aw. 


Oo 


Now consider a connection V on the tangent bundle of a manifold M”. Let e1, 
..., €n be vector fields on an open set U C M” linearly independent at each point. 
For these vector fields, we can consider the dual 1-forms w!, ..., w”, for which 


a (e p= 5, Let us calculate the differentials of these 1-forms. For this purpose, we 
need the 1-forms w! being the entries of the connection matrix. Recall that they are 


determined from the relation Vxye; = ow! (X)e;. 
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For a connection on the tangent bundle, we defined the covariant derivative of a 
1-form as (see p. 156) 


(Vx@)(Y) = 0x(@(Y)) — @(VxY). 
For covariant derivatives thus defined, we have 
dw(X, Y) = dx(@(Y)) — dy(@(X)) — a([X, Y]) 


= (Vx@)(Y) — (Vy@)(X) + o(VxY — VyX —[X, Y]) 
= (Vx@)(Y) — (Vya)(X) — o(T(X, Y)), 


where 7 is the torsion tensor of the connection V. 

The definition of the covariant derivative of a 1-form implies 0 = dx (w! (ej)) = 
(Vxa!)(e;) + o! (Vxe;). We also have w! (Vxe;) = a! (w§ (X)ex) = wh (X34 = 
w, (X) = w) (X)o* (e;). Therefore, 

0 = (Vxa')(e;) +o! (Vxej) = (Vxa! + 0} (X)o*)e;, 


and hence Vyw! = —a}, (X)o*. This implies 


doy! (X,Y) — (w A wj)(X,Y) 
= dx (w! (Y)) — dy (w) (X)) — w! (X,Y) = (wo (Xa (Y) — oF (Y)@j (X)) 
= dx(w) (Y)) + of (Y)aj (X) — ay (w! (X)) — of (X)w] (Y) — o! (LX, YD) 
= Ax (w! (Y)) — (Vxw! )(Y) — dy (w! (X)) + (Vyo!)(X) — wo! ([X, ¥]) 


= w! (Vx¥ — VyX —[X, Y]) =o/ (T(X,Y)). 


Thus, dw! = a* a oj; +o/(T).In particular, for a torsion-free connection, we have 


doi = ak A wy. 


4.11 Solutions of Problems 


4.1 Let us prove that parallel transport along a curve preserves inner product. Let 
V(t) and W(t) be vector fields parallel along a given curve. Then 


d _ (dvit) dWit)\ _ 
vn. Wo) = ( oF wo) + (vio. O) =o, 


avi) sal aw 


because the vectors are orthogonal to all tangent vectors. 
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4.2 Summing the equations Vi(fs) = fVy(s) + (ax f)s and Ve(fs) = 
FVe (s) + (Ox f)s multiplied by A and ww, we obtain Vx(fs) = fVx(s) + A+ 
1) (x f)s, where V = AV! + wV?. Therefore, V obeys Leibniz’ rule if and only if 
A+ w& = 1. The equation V ¢x(s) = fVx(s) holds for any 4 and pw. 


4.3 Let us cover M” by charts over which the bundle is trivial. On a trivial bundle 
over each of these charts we construct a connection as in Example 4.5. We take 
a partition of unity subordinate to the cover by charts and apply it to glue the 
constructed connections together into a single connection. This can be done by using 
Problem 4.2, or, to be more precise, its obvious generalization. 


4.4 If in some system of local coordinates we have Ti, (p) = 0 for alli, 7, k, then 
all components of the torsion tensor vanish at p in these coordinates, and therefore 
all components of the torsion tensor vanish at the point p in ay coordinate system. 

Now suppose that the torsion vanishes at a point p, i.e., re jf p~—= Ti, (p) for all 
i, j, k in any coordinate system. In a neighborhood of p we introduce two ‘coordinate 
systems x and y and denote the Christoffel symbols in these coordinates by ry. and 


6 ‘ These Christoffel symbols are related by 
, ay ax! axi dy’ 2x 


a 
Ps T ask" ayF yh * Bak Byraye 


Let us show that in a sufficiently small neighborhood of p the change of 
coordinates x — y can be performed in such a way that the coordinates of p are 
zero in both coordinate systems and the coordinates of any other point q are related 
by 


x¥(q) = y¥(q) — ti ‘(p)y' (@y! @): 


Indeed, it follows from the ee of the Christoffel symbols that 2 aq) = = 
5k _ ry k (p)yd (q); therefore, 2— yr = bk, Thus, at the point p the Jacobi ace of 


the map (y!(q), Ls VG) (x!(q), ...,X"(q)) is the identity matrix, and any 
such map is locally invertible. It is also clear that 


a2x* 


_yp* 
Dray <a PP) = TP), 


whence it follows that 
Ty5(p) = Vi (p)8z553 — 506, (p) = 0 ;,(p) — P4,(p) = 0 


for all r, s, and ¢. 


Chapter 5 ® 
Riemannian Manifolds Cheek for 


A Riemannian manifold M” is a smooth manifold equipped with an inner product 
(Riemannian metric) g(X, Y) in the tangent space at each point; moreover, this inner 
product is smooth in the sense that g(X, Y) is a smooth function on M” for any 
smooth vector fields X and Y on M”. 

In topology manifolds are considered up to a diffeomorphism. In differential 
geometry Riemannian manifolds are considered up to an isometry, that is, a 
diffeomorphism which transforms the Riemannian metric of one manifold into the 
Riemannian metric of the other. 


5.1 Levi-Civita Connection 


A connection V on the tangent bundle TM” is said to be compatible with the metric 
if 


axg(Y, Z) = g(VxY, Z) + g(¥, VxZ). 


If a connection V is compatible with a Riemannian metric g, then it preserves 
the inner products of vectors parallel along any curve. Indeed, suppose that vector 
fields Y and Z are parallel along a curve and X is a vector tangent to this curve at 
some point. Then Vx Y = 0 and Vx Z = 0, whence dx g(Y, Z) = 0. 

Recall that a connection V is torsion-free (has zero torsion) if 


Ve¥ — Vy X¥ = [X, ¥1. 


Theorem 5.1 (Levi-Civita) On a Riemannian manifold there exists a unique 
torsion-free connection compatible with the metric. 
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Proof For brevity, we introduce the notation (X, Y) = g(X, Y). If a connection is 
compatible with the metric and has zero torsion, then 
(VxY, Z) = dx(Y, Z) — (Y, VxZ) 
= 0x(Y, Z) — (Y, VzX) — (Y, [X, Z]) 
= dx(Y, Z) — 0z(Y, X) + (WzY, X) — (Y, [X, Z]) 
= 0x (Y, Z) — 0z(Y, X) + (VyZ, X) + ((Z, Y], X) — (Y, LX, Z]) 
= 0x(Y, Z) — 0z(Y, X) + dy(Z, X) — (Z, VyX) 
+ ((Z, Y], X) — (Y, [X, Z)) 
= Ox (Y, Z) — 0z(Y, X) + dy(Z, X) — (Z, VxY) 
— (Z,[Y, X]) + (Z, VY], X) — (Y, [X, Z)). 


As a result, reducing this expression to a more symmetric form, we obtain 


2(VxY, Z) = dx (Y, Z) + dy (X, Z) — 0z(X, Y)+ (5.1) 
+ ([X, Y], Z) — (X, Z], Y) — (CY, Z], X). 


Since inner product is nondegenerate, knowing the inner products (Vy Y, Z) for 
all Z, we can calculate Vx Y. Therefore, using (5.1), we can express Vx Y in terms 
of the Riemannian metric. Conversely, a straightforward but fairly long calculation 
(see the solution of Problem 5.1) shows that this formula indeed determines the 
vector Vx Y, and the operation V defined by this formula is a torsion-free connection 
compatible with the Riemannian metric. Oo 


HISTORICAL COMMENT Theorem 5.1 was proved in 1929 by Levi-Civita. For- 
mula (5.1) was obtained in 1950 by Koszul. It is known as Koszul’s formula. 


Problem 5.1 Prove that formula (5.1) determines a torsion-free connection com- 
patible with the Riemannian metric. 


The connection in Theorem 5.1 is called the Levi-Civita connection. 

Formula (5.1) makes it possible to express the Christoffel symbols of the Levi- 
Civita connection in terms of the metric. For this purpose, we choose a local chart 
and identify it with a domain in R”. As the vector fields X, Y, and Z we take 
the coordinate vector fields e;, e;, and ex in IR". Taking into account the equation 
Vejej = Ti ,e1, we obtain 


Ogik  Ogik “tt ) (5.2) 


1 
in 1 -_ . ene ‘ _ 
gxilj; = (Vj; ek) = (Vee; ek) = 5) (#4 + ax; crs 


where gij = g(éj, ej). 
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5.2 Symmetries of the Riemann Tensor 


The curvature tensor of the Levi-Civita connection is often called the Riemann 

tensor. Applying the superscript lowering operation to the Riemann tensor, we can 
i Keen P : 

pass from the tensor R; ik to the tensor Ryjjx = gipR; ik? for which the symmetry 

properties have simpler form. 


Theorem 5.2 The tensor Rjjjk satisfies the following identities: 


(@) Rujk = —Riikj; 

(b) Rujk + Riij + Rijri = 9; 
(c) Rijk = —Ritjr: 

(d) Rijk = R jki- 


Proof Recall that it follows directly from the definition of the curvature tensor that 
RI ik = —R' ,;- Lowering the superscript, we obtain identity (a). Identity (b) follows 
from the first Bianchi identity (see Theorem 4.6 on p. 162). Indeed, multiplying 


the identity Rig + Ri. eae Rigi = 0 by gj» for each p and summing the resulting 
identities, we obtain (b). , 


Let us prove identity (c). First, we show that it is equivalent to the identity 
(R(X, Y)Z, W) + (R(X, Y)W, Z) = 0. (5.3) 
For X = x/d;, Y = Yea, Z = Z'd;, and W = W'd, we have 
(R(X, Y)Z, W) = (R(X/9;, Y*de)(Z'9;), W' a1) 
= (XIY*Z' RE ay, W'A) 
= X!¥*Z' We Rig = XIY"Z' W! Risin, 
(R(X, Y)W, Z) = XI YF WZ! Ruiig = XIY*Z'W! Rinjc. 
In turn, identity (5.3) is equivalent to 
(R(X, Y)Z, Z) = 0. (5.4) 
Indeed, (5.4) implies 


O= (R(X, Y)(Z+ W),Z+W) 
= (R(X, Y)Z, Z) + (R(X, Y)W, W) + (R(X, Y)Z, W) + (R(X, Y)W, Z) 
= (R(X, Y)Z, W) + (R(X, Y)W, Z). 
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The vector R(X, Y)Z ata point x € M” depends only on the vectors X (x), Y (x), 
and Z(x); therefore, it suffices to prove (5.4) in the case where [X, Y] = 0. In this 
case, we have 


R(X, Y)Z = VxVy Z — Vy VxZ. 


The connection V is compatible with the metric; therefore, 0x(Z, Z) = 2(VxZ, Z), 
whence 


dx dy (Z, Z) = 2(VxVyZ, Z) +2(Vy Z, VxZ). 
Similarly, 
dy dx(Z, Z) = 2(Vy Vx Z, Z) + 2(VxZ, VyZ). 


Since the vector fields X and Y commute, it follows that dydy(Z,Z) = 
dy dx(Z, Z), and hence 


(VxVyZ, Z) = (VyVxZ, Z), 


as required. 
Identity (d) follows directly from identities (a), (b), and (c). Indeed, let us write 
identity (b) in four ways: 
Riijk + Rikig + Rijxi = 9, 
—Rijki — Rijk — Rikij = 9, 
Rye — Ryikt — Rjuik =9, 
Rug + Raji + Reijt = O. 


Summing these four identities and using (a) and (c), we obtain 
2Riijk —2Rjmi =O, Le, Rije = Rjii, 


as required. Oo 


Corollary In dimension 2 all nonzero components of the tensor Rjjjx are equal to 
R212. 
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5.3. Geodesics on Riemannian Manifolds 


For the Levi-Civita connection on a Riemannian manifold M”, geodesics have 
various special properties caused by the presence of the Riemannian metric and the 
symmetry of the connection. For example, for a geodesic on a Riemannian manifold, 
the length of the velocity vector is constant. Indeed, we have 


d 
a y') = UVyry', y’) = 0. 


Theorem 5.3 (Gauss’ Lemma) For any point x © M"” and any nonzero 
vector Vo € T,M", the curve exp,(tVo) is orthogonal to the hypersurface 
{exp, (V) | || Vl] = const}. 


Proof Let V(s) be a curve on the sphere ||V|| = const. We set f(s,t) = 
exp, (tV(s)). It is required to prove that ( a. it) = 0 fort = 1. Clearly, 
of @a C) 
af af) _(,, af af) , (af g, a 
at at’ as a Of Os at’ as 


a x= = 0. Since the coordinate 


vector fields Ti and 2 jy commute, it follows pea the symmetry of the connection 


The curve oe eS) is geodesic, whence V 


that 
0 1 
af 9%) _ (8,9, 4) 18 (af wf) 9 
ar’ as ar’ a ar) 2as ar’ ar 
because of is the velocity vector of the geodesic. Thus, 2 oe, st) = = 0, and hence 


(¢. it) does not depend on f. It remains to check that (ir x) = = Ofort = 0. 


But f(s, 0) = exp, (0) = x is a constant point. Therefore, ois, 0) = 0. oO 


A connected Riemannian manifold M” can be endowed with the structure of a 
metric space as follows. Given a smooth curve y : [a, b] — M”, we define its length 


(vy) by 


b 
i= / ly'@llat, 


where ||y’(t)|| is the length of the velocity vector of this curve (this vector lies in the 
tangent space, and the length of a tangent vector is determined by the Riemannian 
metric). The length of a piecewise smooth curve equals the sum of lengths of the 


174 5 Riemannian Manifolds 


smooth curves from which it consists. For points x, y € M", the distance d(x, y) is 
defined as 


d(x, y)= its 


where the greatest lower bound is taken over all piecewise smooth curves joining x 
and y. 


Problem 5.2. Check that that d(x, y) has all properties of a metric. 


Problem 5.3. Check that the topology induced by the metric d(x, y) coincides with 
the standard topology of the manifold. 


Problem 5.4 Is it true that any two points x and y of a connected Riemannian 
manifold can be joined by a smooth curve y for which d(x, y) = /(y)? 


Theorem 5.4 Suppose that points x and y are joined by a smooth curve y for which 
d(x, y) = L(y). Then y is a geodesic (to be more precise, y becomes a geodesic 
after a change of parameter). 


Proof It suffices to prove that any point z of the curve y has a neighborhood inside 
which the arc of y that contains z is a geodesic. Let D be an open ball in T,M” 
centered at the origin and such that the restriction of exp, to D is a diffeomorphism, 
and let U = exp, D. Oo 


Lemma 5.1 Let w: [a,b] — U be a smooth curve represented in the form w(t) = 
exp, (r(t)V (0), where r(t) is a number and V(t) is a vector of length 1. Then 
I(@) > |r(a) — r(b)|, and the equality is attained if and only if the function r is 
monotone and the function V is constant. 


Proof We set f(r, t) = exp,(rV(t)). Then w(t) = f(r(t), t), whence 


do) _ af. , af 
de ar ot 


According to Gauss’ lemma, the curve f(r, fo) is orthogonal to f (ro, t); therefore, 


the vectors or and of are orthogonal. The curve exp,(rV(to)) is geodesic and has 


velocity vector V (fo) at the point z; hence [7 


> IMP, 


dw(t) || 
dt 


ral? + | 


which means that 


I(w) = [ 


a“ 


jar = [ rt) dt > |r(a) —r(d)I, 
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and the equality is attained only if or = 0,1.e., V(t) = const and the function r is 
monotone. oO 


In the case where r is smaller than the radius of the ball D, we say that the 
hypersurface {exp,(V)| || V || =r} is a geodesic sphere of radius r centered at z. 

Lemma 5.1 can be formulated as follows: the length of any curve in U joining 
geodesic spheres of radii r; and r2 centered at z is at least |r; — r2|. Let y; be a 
part of the curve y which is entirely contained in U and joins the point z with some 
point z’ = exp,(rV), where ||V || = 1. For each ¢ € (0,7), the curve y; contains a 
curve joining concentric geodesic spheres of radii ¢ and r; therefore, /(y,) > r—. 
For ¢ — 0, we have /(y,) > r. The equality is attained only in the case where 7 is 
a geodesic. 


Corollary Suppose that points x and y are joined by a piecewise smooth curve y 
(consisting of finitely many smooth links) for which d(x, y) = l(yv). Then the curve 
y is geodesic (in particular, it is smooth). 


Proof It is sufficient to check that Lemma 5.1 remains valid for a piecewise smooth 
curve w. Such a curve consists of smooth arcs joining concentric geodesic spheres 
of radii r}, r2, ..., rg. Its length equals |r; — rx| if and only if the sequence ry, r2, 
..-,/k iS monotone and the function V (ft) is constant. oO 


Problem 5.5 Prove that any geodesic on the sphere S” is a great circle or its part. 
(A great circle is the intersection of the sphere with a plane through its center.) 


5.4 The Hopf—Rinow Theorem 


A Riemannian manifold M” is said to be geodesically complete if, for any point 
x € M” and any vector V € 7,M”, the point exp,(V) is defined, or, in other 
words, if any geodesic can be extended without bound. The Riemannian metric on a 
geodesically complete Riemannian manifold is said to be geodesically complete as 
well. 


Example 5.1 The Riemannian manifold R? \ {0} is not geodesically complete (the 
geodesic going to the point 0 cannot be extended beyond this point). 


Example 5.2 The Riemannian manifold R! x S! is geodesically complete (any 
geodesic on a cylinder can be extended without bound). 


Theorem 5.5 (Hopf—Rinow [Ho]) On a connected geodesically complete Rie- 
mannian manifold M" any two points x and y can be joined by a shortest geodesic. 


Proof [Rh] We begin with producing a geodesic which is a likely candidate, and 
then we prove that this is indeed the required geodesic. 

Let d = d(x, y). Consider a geodesic sphere S of sufficiently small radius r 
centered at x. The set S is compact; hence there exists a point z € S for which 


176 5 Riemannian Manifolds 


Fig. 5.1 The choice of the 
point z’ 


a a 
vw 


d(z, y) = Minyes d(w, y). Let z = exp, (eV), ||V|| = 1. We want to prove that the 
curve y(t) = exp,(tV), ¢t € [0, d], is the required geodesic joining points x and y 
(obviously, it has length d). 

We will prove that the distance between points y(t) and y equals d — t for each 
t € [r, d]; then, fort = d, we will have y(d) = y. First, we prove this fort = r. Any 
curve joining the points x and y intersects the geodesic sphere S in some point w. 
Therefore, 


Re 
R 


d=d(x, y) =min (d(x, w)+d(w, y)) =r+mind(w, y)=r+d(z,y). 
wes wes 


Thus, d(z, y) = d — r, whence z = y (r). Let to be the least upper bound for those 
t € [r, d] at which the required equality holds. Then, by continuity, it also holds at 
to. Suppose that tg < d. Let S’ be a geodesic sphere of sufficiently small radius r’ 
centered at y (fo). Choose a point z’ of S’ for which d(z’, y) = minyes d(w’, y) 
(see Fig. 5.1). We have 


d~ 19 =d(y(o), y) = min (d(y(o), w') + aw’, y)) =r" +d@, »). 
Therefore, 


d(z',y)=d—-(t +r’). (5.5) 


We claim that z’ = y(to +r’). Indeed, on the one hand, by the triangle inequality, 
we have 


d(x,z)>d(x,y)—d(y,z)=to +1’. 
On the other hand, if we go first from x to y(t) along the geodesic y and then 
from y (to) to z’ along a shortest geodesic, then we will obtain precisely a path of 


length to + r’. This path has minimum length; therefore, according to the corollary 
of Theorem 5.4, it is geodesic. Thus, z’ = y(t +r’), and (5.5) implies 


d(y(t +r’), y)=d-(o+r), 


i.e., the required equality also holds for fg + r’ > to. This contradiction proves 
that to = d. oO 
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The manifolds in Examples 5.1 and 5.2 are homeomorphic. Thus, these examples 
show that the notion of geodesic completeness is not topological: it depends on 
the choice of a Riemannian metric. But the geodesic completeness of a connected 
Riemannian manifold is equivalent with the completeness of this Riemannian 
manifold treated as a metric space. Before proving this, we recall the definition 
of a complete metric space. 

Let X be a metric space with metric d. A sequence of points x1, x2, ...of X 
is called a Cauchy sequence if, for any e > 0, we can choose a number N so that 
d(Xn, Xm) < € forallm,n > N.A metric space is said to be complete if any Cauchy 
sequence has a limit point in this space. 


Theorem 5.6 Suppose that any closed bounded subset of a metric space X is 
compact. Then X is a complete metric space. 


Proof Let {x;} be a Cauchy sequence in X. Then, for each ¢ > 0, we can 
choose an N so that d(xy,X%m) < € forn,m > N. Consider the closed subsets 
Ci = {xj, Xi41,...} of X. The closed set C = Cj is bounded, because d(x, xj) < 
d(x1,Xn41) +d(xn41, Xi) < d(x1, Xn41) + fori > N. Therefore, C is compact. 

Suppose that the open sets U; = C \ C; cover C, i.e., VC; = @. Then the cover 
of the compact set C by these open sets contains a finite subcover, and hence some 
finite intersection of C; is empty, which is not true. Therefore, VC; contains a point 
x. We have d(x, x) < ¢ forn > N, because x € C,. Hence x is a limit point of the 
Cauchy sequence {x;}. oO 


The Hopf—Rinow theorem readily implies the following completeness criterion 
for a Riemannian manifold (this criterion was also proved by Hopf and Rinow [Ho]). 


Theorem 5.7 A connected Riemannian manifold M” is complete as a metric space 
if and only if it is geodesically complete. 


Proof First, suppose that M” is geodesically complete. Let us show that any closed 
bounded set C C M” is compact. Suppose that the distance from a point x € C to 
any other point of the set C is at most d. Take a closed ball of radius d in TM” 
and consider its image B under the map exp,. The set B is compact. Moreover, 
according to the Hopf—Rinow theorem, it contains C. Thus, C is a closed subset 
of the compact space B, and hence it is compact. Theorem 5.6 implies that M” is 
complete as a metric space. 

Now suppose that M” is complete as a metric space. We must show that, given 
any point x €¢ M” and any vector V € T;M”", the curve y(t) = exp,(tV) is defined 
for all t > 0. Suppose that the least upper bound for those ¢ at which y(t) is defined 
equals ~@ < oo. Choose a sequence tz € (0, w) converging to a as k — oo. Clearly, 
tz, is a Cauchy sequence. Therefore, so is the sequence y (t,), because 


d(y (te), ¥Ctm)) SIV Mtte.tm1) = VIL Ite = tn | 


Thus, the sequence y(t) converges to some point y ¢ M”. Let U be a bounded 
neighborhood of y, and let U be its closure. Since the metric space M” is complete, 
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if follows that U is compact. In the restriction of the tangent bundle to U consider 
the set of vectors of length || V ||. If the neighborhood U is small enough, then this set 
is the direct product of two compact subsets U and $”~!, and hence it is compaet 
Thus, the sequence f; has a subsequence f;, for which the sequence of vectors y’ (tg; ) 
converges to a vector W € T,M”. The velocity vector of the curve exp,(tW) at 
= 0 is W. The curve y(t + @) contains a sequence of points such that this 
sequence itself converges to y and the velocity vectors at the points of the sequence 
converge to W. Both curves are geodesic, so that it follows from the uniqueness of 
a geodesic that y(t +a) = exp, (tW) for small negative t. But exp, (T W) is also 
defined for small positive t; therefore, the geodesic y(t) is defined fort = t + a, 
provided that t is a small positive number. We have arrived at a contradiction. O 


Remark Looking carefully at the proofs of the Hopf—Rinow theorems, we see that, 
to ensure the geodesic completeness of a connected manifold M”, it is not necessary 
to require that any geodesic can be extended without bound; it is sufficient that 
any geodesic through some fixed point can be extended without bound. Indeed, 
according to Problem 5.6, any closed bounded subset of such a manifold M” is 
compact, so that M” is a complete metric space (by Theorem 5.6), which means 
that this manifold is geodesically complete. 


Problem 5.6 Suppose that any geodesic from some fixed point x in a connected 
Riemannian manifold M” can be extended without bound. Prove that 
(a) the point x can be joined to any other point y € M” by a minimal geodesic; 
(b) any closed bounded subset of M” is compact. 


HISTORICAL COMMENT Heinz Hopf (1894-1971) and his student Willi Rinow 
(1907-1979) proved Theorems 5.5 and 5.7 in 1931. The constructive proof of 
Theorem 5.5 given above was proposed by Georges de Rham (1903-1990) in 1952. 


5.5 The Existence of Complete Riemannian Metrics 


Two Riemannian metrics on a manifold M” are said to be conformally equivalent 
if, for each point x e€ M"”, the inner products in the tangent space 7, M” 
corresponding to these Riemannian metrics are proportional (the proportionality 
coefficient depends only on the point x). 


Theorem 5.8 (Nomizu—Ozeki [No]) For any Riemannian metric g on a connected 
manifold M", there exists a geodesically complete Riemannian metric g' confor- 
mally equivalent to g. 


Proof For each point x € M”, we define r(x) as the least upper bound for those 
numbers r for which the closure of the set D., = {y € M"|d(x,y) < r} is 
compact. If r(xo) = oo for some point xg, then M” is compact and hence the metric 
g is complete. In what follows, we assume that r(x) < oo for all x. 
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Let us show that the function r(x) is continuous. To this end, it suffices to check 
that |r(x) —rQy)| < d(x, y). Ifz € D,,,, then d(x, z) < r and d(y, z) < d(y,x)+ 
d(x,z) < d(y,x)+r;hence z € Dy -+a(x,y). Thus, Dy, C Dy ++a(x,y)- Therefore, 
ifr + d(x, y) < r(y), then the closure of Dy ++a(x,y) is compact, so that the closure 
of D,,, is compact as well, i.e., 7 < r(x). It follows that r(x) > r(y) — d(x, y). The 
proof of the inequality r(y) > r(x) — d(x, y) is similar. 

Let w(x) be a smooth function such that w(x) > 1/r(x) forall x € M”. Then the 
Riemannian manifold M” with metric g’ = (o(x))"g is geodesically complete. To 
prove this, it suffices to show that D 13 = {y € M"|d'(x, y) < 1/3} is contained 
in Dy r(x)/2. Indeed, then the closure of D. 1B is compact, and hence any geodesic 
can be extended by a distance of 1/3, which means that any geodesic can be extend 
without bound. 

Suppose that d(x, y) > r(x)/2. Let y(t),a < t < b, bea piecewie smooth 
curve joining the points x and y. Its length in the metric g equals L = ri | = | dt. 
Clearly, L > d(x, y) => r(x)/2. The length of the same curve in the metric 9’ ‘equals 


b d b 
ee i o(y(t)) IF) dt = (vc) f 


for some fo € [a,b]. Therefore, L’ = o(y (to))L > ca” Next, Ir (vy (to)) _ 
r(y (to) 


dt = w(y (to))L 


r(x)| < d(x, x9) < L, whence r(y (to)) < r(x) + L. It follows that L’ > OTE: 
But L > r(x)/2, so that L’ > 1/3. Thus, d’(x, y) > 1/3, and hence D.13 is 
contained in Dy. ;-(x)/2- oO 


We say that a Riemannian metric on a manifold M” is bounded if the distance 
between any two points of M” in this metric is smaller than some constant. 


Theorem 5.9 (Nomizu—Ozeki [No]) Any Riemannian metric g on a manifold M" 
is conformally equivalent to a bounded Riemannian metric g’. 


Proof In view of Theorem 5.8, we can assume that the metric g is geodesically 
complete. Let us fix a point x9 € M” and consider the function d(x, xo). This 
function is continuous. Take a smooth function w(x) for which w(x) > d(x, xo). 
Let us show that the Riemannian metric g’ = e~7®) g is bounded. 

The metric g is geodesically complete, and hence, for every point x € M”, there 
exists a geodesic y which joins the points x and xo and has length d(x, x9) = L. 
We can assume that y (0) = xo and the parameterization s of the curve y is natural. 
Any part of y is a shortest geodesic, so that d(y(s), x9) = s forall s € [0, L]. 

The length of the velocity vector ae in the metric g’ equals e~°” )), Therefore, 


the length of y in the metric g’ equals L’ = i e °Y ds, Since w(y(s)) > 
d(y(s), xo) = S, it follows that 


L lee) 
L' < / eds < / e *ds =1. 
0 0 


Thus, d’(x, x9) < 1 for any point x €¢ M”. Oo 
0 yp 
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Problem 5.7 Prove that if any Riemannian metric on a manifold M" is geodesically 
complete, then M” is compact. 


5.6 Covariant Differentiation of Tensors 


Specifying a connection on a manifold is equivalent to specifying a covariant 
derivative VyY for each pair of vector fields X and Y. In considering arbitrary 
connections, we already mentioned that, using the covariant differentiation of vector 
fields, we can define the covariant differentiation of covector fields (differential 1- 
forms). It is defined so that 


dx (@(Y)) = (Vxa)(Y) + @(VxY). 
Thus, the definition of the covariant derivative Vyw is as follows: 
(Vx@)(Y) = dx(w(Y)) — o(VxY). 


The covariant derivative of a function is the derivative of this function in the 
direction of a vector field X: Vy f = Oxf. 
In local coordinates covariant derivatives in the directions of the vector fields 2 


are denoted by V;. The covariant derivative of a vector field with coordinates V/ 
equals 
Laan 
(V;V)/ = Oxt + V Vip. 


Let us calculate the covariant derivative of a covector field in local coordinates: 


(Viw) (ej) = 0) (@(ej)) — o(Vie;) 


Thus, the covariant derivative of a covector field with coordinates w; equals 


do; k 
(Via); = aut Ol ;- 


In particular, Vidx* = Ti dx, 
The covariant derivative of a tensor field of any type is defined by Leibniz’ rule: 


Vx(T @ S) = (VxT) ®S+T ® (VxS), 
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where T and S are any tensor fields. By definition, the covariant derivative is also 
linear: if T and S are tensor fields of the same type, then 


Vx(T + S) = VxT + VxS. 


For example, let us calculate the covariant derivative of a tensor field Ai of type 


(1, 1): 


aAi 
Vi(Aje; @ dx*) = —fe; @ dx’ + ALVi(e;) @ dx* + Aye; ® Vi(dx*) 


Ox 

dAi, k ipm k i pk m 
= oxl ®dx + A,Tj; em @dx = Ay Tin ei ® dx 

OA At 
= k m i 
=. —te ® dx* + A; rie; @ dx* —A i Tire; @ dx*. 

Thus, 
_ @Al 
VIAL = or + ATT), — Ali. 


A similar calculation shows that the covariant derivatives of tensor fields of types 
(2, 0) and (0, 2) are 


ik 
ik _ dA! mk pi impk 
V/A‘ = ar Pim + A Vins 
ox 
0 Aik 
Vi Aik = ; _ Am} _ AimV ie. 
Ox 
The covariant derivative of a tensor field of type (p, g) with components T. & a is 
is a tensor of type (p, g + 1) whose components are calculated by the formula 
iji2...ip Pp 
iigip Jij2-Jq pitckotpy. iji2...Up 
MEG le ~ ay ate ya: i ja-dq 2 Pig Vvin : 6.6) 
k=1 


where the [° ‘ are the Christoffel symbols. 


Theorem 5.10 (Ricci’s Lemma) The covariant derivatives of the tensors gi; and 
g'! vanish, i.e., these tensor fields are parallel. 


Proof First, let us check that V; gj = 0 for alli, k,/. By definition, 


Vigik = — &mkV 7, — 8imV i: 


Oik 
ax! 
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According to formula (5.2) on p. 170, we have 


1 (Ogik | Og Ogil 1 (dOgix | Ogi  OgK 
Bs Alors ————— — = 7 7 Vr” = = —— — — —— a 
i aad) ( ax; | Ox; OxE Gime a \ ou” Bx, OR 


Therefore, 
O8ik 
SmkV i; + Simp = “‘Bxl? 


as required. 
Now the relation V;g'* = 0 follows from 


0 = Vi(gieg’) = (Vigixg + gikVigi« = girVig'. 


im] 


Theorem 5.11 (second Bianchi Identity) The covariant derivatives of the Rie- 
mann curvature tensor satisfy the identity 
h h h 
Vi Rijn + ViRing + VeRij = 9. 
Proof The proof of the second Bianchi identity is similar to that of the first Bianchi 
identity for any symmetric connection (see Theorem 4.6 on p. 162). Choose local 
coordinates in a neighborhood of a given point x so that all Christoffel symbols at 


this point vanish. Then the covariant derivative at this point is merely the partial 
derivative. For the curvature tensor we have obtained the expression (see p. 162) 


i 


or.. 
—s i Jt Prl Pri 
Rijx = aa Oxk + ie gp - PT pe 


2ph 2pA 
VR. = ae _ ae 
Uk axlaxi — axtaxk 
Indeed, calculating the derivative of FE rv p> we obtain the sum of two terms each 


of which contains, as a factor, a Christoffel symbol, which is zero. 
A similar calculation shows that 


2ph 2ph 2ph 27h 
h arly = aT; VR), = aT = aT; 
axiaxk = axJax!’ Hi axkax! = axkaxi’ 


Vi Rig = 


Summing the three expressions for covariant derivatives, we obtain the required 
identity. oO 
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In coordinate-free form, the second Bianchi identity is written as 
(VwR)(X, Y)Z + (VxR)(Y, W)Z + (Vy R)(W, X)Z = 0. 
Indeed, if X = X/e;, ¥Y = Y*ex, Z = Z'e;, and W = W'e;, then 
(VwR)(X, Y)Z = X/Y*zi W'(VRT en. 


Remark A different formulation of the second Bianchi identity is given in Theo- 
rem 4.7 on p. 166. We will not use or prove the equivalence of these formulations 
for the Levi-Civita connection. 


5.7 Sectional Curvature 


The Riemann tensor is a complicated object with four indices. But it can be 
described in terms of a simpler object, sectional curvature. The sectional curvature 
assigns a number defined by using the Riemann tensor to each two-dimensional 
plane in the tangent space. The Riemann tensor is completely determined by the 
sectional curvature. Sectional curvature is closely related to Gaussian curvature. In 
particular, for a surface, the tangent space is two-dimensional, and the sectional 
curvature assigned to this two-dimensional space is equal to the Gaussian curvature. 
Recall that, for a surface, the nonzero components of the Riemann tensor equal, up 
to sign, the Gaussian curvature. 

We set R(X, Y, Z, W) = (R(X, Y)Z, W). Then R(X/e;, Y*ex, Z'e;, Wie) = 
(R(X e;, Y*ex)Z'e;), W'ey) = (Ri, X! ¥*Z'e,, W'e) = gpl R}..X! y*ziw! = 
Rij X /¥*Z' Ww’. Consider the plane I generated by vectors X and Y in the space 
T, M". Let us check that the number 


R(X, Y, Y, X) 


a (X, XY, Y) — (X,Y? 


depends only on the plane I. 


1 y2 
Let P = (7, _ be the matrix of the coordinates of the vectors X and Y in 


some orthonormal basis of the plane IT. Then k(X, Y) = (X, X)(Y, Y) — (X, Y)? is 
the determinant of the matrix P P? . Therefore, k(X’, Y’) = (det A)?k(X, Y), where 
A is the transition matrix from the basis X, Y to the basis X’, Y’. 

Consider the restriction of the polylinear function R(X, Y, Z, W) to the plane 
II. From the symmetry properties of the curvature tensor it follows, in particular, 
that this function is skew-symmetric with respect to the first two and the last two 
arguments. Any bilinear skew-symmetric function B(X, Y) = bj; X ‘Y/ in the plane 
has the form c(X!Y* — X?Y'), where c is a constant. Indeed, since B(X, X) = 0, 
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we have bj; = bo2 = O and bj2 + bo, = O. Let us first fix Z and W. Then we obtain 
R(X, Y, Z, W) = B(Z, W)(X'Y2 — X?Y!), where B(Z, W) isa skew-symmetric 
bilinear function, ie., B(Z,W) = c(Z'W? — Z?W'). Thus, R(X, Y,Y,X) = 
—c(X!y* — X?y!)*. Therefore, R(X’, Y’, Y’, X’) = (det A)? R(X, Y, Y, X), where 
A is the transition matrix from the basis X, Y to the basis X’, Y’. As a result, we see 
that, for any pair of linearly independent vectors X, Y in the plane IT, the ratio of 
R(X, Y, Y, X) to (X, X)(Y, Y) — (X, Y)? is the same, as required. 

The number Ky is called the sectional curvature corresponding to the plane I. 
If the basis X, Y is orthonormal, then Ky = R(X, Y, Y, X). In particular, forn = 2, 
the sectional curvature at a point p is the Gaussian curvature at p. Indeed, if X = e, 
and Y = eo, then j =/ = | andk =i = 2, whence R(X, Y, Y, X) = Ri212 = K. 
In the general case, the sectional curvature at a point p is the Gaussian curvature at 
p of the image of the plane II under the map exp, (see Theorem 5.18). 


Theorem 5.12 The sectional curvature completely determines the curvature tensor. 
Proof First, we show that R(X, Y)Z can be represented in terms of expressions of 
the form R(X, Y)Y. To this end, we sum the three equations 
R(X, Y+Z)V + Z) = R(X, Y)Y + R(X, Y)Z 
+ R(X, Z)Y + R(X, Z)Z, 
—R(Y,X + Z)\(X+ Z) = —-R(Y, X)X+ R(X, Y)Z 
+ R(Z,Y)X — R(Y, Z)Z, 
0= R(X, Y)Z+ R(Y, X)Z. 
The sum of the next-to-last terms on the right-hand sides vanishes by virtue of the 
first Bianchi identity: R(X, Z)Y + R(Z, Y)X + R(X, Y)Z = 0. As a result, we 
obtain 
3R(X, Y)Z = R(X, Y + Z)\(Y + Z)— R(Y, X + Z)\(X + Z) 
— R(X, Y)Y — R(X, Z)Z + RY, X)X 4+ RY, Z)Z. 
Now let us show that (R(X, Y)Y, Z) can be represented in terms of expressions 


of the form (R(X, Y)Y, X). It follows from the symmetry properties of the Riemann 
tensor that 


(R(X, Y)Y, Z) = (RY, Z)X, Y) = (R(Z, Y)Y, X). 


Therefore, for fixed Y, the bilinear form B(X, Z) = (R(X, Y)Y, Z) is symmetric. 
Thus, we can use the formula 


OB(X, Z) = BUX 2, X47) — BX XY BZD), 
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which leads to the required representation. For example, the doubled first term in 
the expression for 3R(X, Y)Z is represented as 


2(R(X, Y+ Z)\(Y¥ + Z), W) = (R(X + W,Y 4+ Z)(¥ + Z),X+ W) 
— (R(X, Y4+ Z)\V¥ + Z), X) — (RWW, Y + Z)(V + Z), W). 


Problem 5.8 Show that the second derivative with respect to w and § of 
R(X +aZ,Y+BW,X+aZ,Y+BW)—R(X+aw,Y+BZ,X+aw,Y+ BZ) 


at the point a = 0, 6B = 0 equals 6R(X, Y, Z, W). Prove Theorem 5.12 by using 
this fact. 


A Riemannian manifold M” is called a space of constant curvature if its sectional 
curvature is constant, i.e., depends neither on a point x € M” nor on a plane 
in T,M”. The following theorem shows that, for n > 3, the former condition is 
redundant: the independence of the sectional curvature of a point follows from its 
independence of a plane at each point. 


Theorem 5.13 (F. Schur) A connected Riemannian manifold of dimension n > 3 
whose sectional curvature is constant (does not depend on the choice of a plane 11) 
at each point is a space of constant curvature. 


Proof Suppose that the sectional curvature at a point p equals K for any plane IT. 
First, we prove that, in this case, we have R(X, Y,Z,W) = K - Ri (X,Y, Z, W) 
at this point, where R1(X, Y,Z,W) = (Y, Z)(X, W) — (X, Z)(Y, W). We set 
k(X,Y) = R(X, Y,Y, X) and ki(X,Y) = R(X, Y,Y,X) = (X, X)(Y,Y) - 
(X,Y). By assumption, k(X,Y) = K - k\(X,Y) at the point p. In the proof 
of Theorem 5.12 we represented R(X, Y, Z, W) as the sum of terms of the form 
R(X, Y, Y, X) = k(X, Y): 


OR(X, Y,Z,W) =2R(X,Y+Z,Y4+Z,W)+... 
=R(X+W,Y4+Z,Y4+Z,X4+W)+... 
=k(X+W,Y+Z)+... 

In view of this representation, the equation k(X,Y) = K - k\(X,Y) implies 
R(X, Y,Z,W) = K - R(X, Y, Z, W). 
According to Ricci’s lemma, the tensor field g;; is parallel; therefore, the tensor 


field R, with components gjxgj1 — 8ij xi 18 parallel as well: Vy Ri = O for any U. 
Thus, 


(Vu R)(X, Y, Z, W) = du K - Ri (X,Y, Z, W). 
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This means that, for any X, Y, and Z, we have 
(Vy R)(X, Y)Z = dy K - ((Y, Z)X — (X, Z)Y). 


We write similar equations for (Vx R)(Y, U)Z and (Vy R)(U, X)Z and sum the 
three equations, using the second Bianchi identity 


(VwR)(X, Y)Z + (Vx R)(Y, W)Z + (Vy R)(W, X)Z = 0 
(see p. 183). As a result, we obtain 


0 = dK - ((Y, Z)X — (X, Z)Y) + oxK - (U, Z)Y — (Y, Z)U) 
+ dy K - ((X, Z)U — UU, Z)X). 


Let X be an arbitrary vector, and let vectors Y, Z, and U be chosen so that the 
vectors X, Y, and Z are pairwise orthogonal (this can be done because n > 3), 
U = Z, and ||Z|| = 1. Then (Y, Z) = (X, Z) = 0 and (U, Z) = 1, which implies 


dxyK -Y—odyK-X =0. 


Since the vectors X and Y are linearly independent, it follows that dy K = 0 for any 
vector X. Therefore, K is a constant. oO 


HISTORICAL COMMENT The theorem that the constancy of the sectional curvature 
in all directions implies its constancy at all points was proved by Friedrich Schur 
(1856-1932) in 1886. 


5.8 Ricci Tensor 


The Riemann tensor is too complicated. It is easier to study it by considering some 
of its components, such as the sectional curvature. The Ricci tensor is, as well as the 
sectional curvature, a component of the Riemann tensor. As opposed to the sectional 
curvature, it does not completely determine the Riemann tensor. Nevertheless, the 
Ricci tensor reflects some important properties of Riemannian manifolds. 

The Ricci tensor assigns the trace of the map Z +» R(Z, Y)X to each pair of 
vector fields X and Y: 


Ric(X, Y) = tr(Z bh R(Z, Y)X). 
In local coordinates, we obtain the map 


Zhe; > R(Zie;, Y*ex) Xe; = (RE LY*X') Zep. 
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The trace of the map Z/e ji Ay Zie p equals Ay (the summation is with respect 


to j); therefore, Ric(X, Y) = Rix! yk = Rig X'Y/. Thus, in local coordinates 
Ric = Rijdx! @ dx/, where Rij = Rigi: 

If the basis e], ..., @, is orthonormal, then Rig = Rrjx; hence Rij = 
> x=1 Reikj in an orthonormal basis. The symmetry properties of the Riemann 
tensor imply Rj; = Rjj, i.e., the Ricci tensor is symmetric. Therefore, the Ricci 
tensor is completely determined by the values R(X, X), where X ranges over all 
unit vectors. 


Problem 5.9 Let e1, ..., e€, be an orthonormal basis. Prove that Ric(e;, en) = 
Sm K (ej, €n), where K (e;, én) is the sectional curvature corresponding to the 
plane spanned by the vectors e; and ép. 

The scalar curvature S is the trace of the Ricci tensor with respect to the 


Riemannian metric: § = g'/ R;;. 


Problem 5.10 Let e,, ..., @, be an orthonormal basis. Prove that S$ = 
a oe ee K (ej, ex), where K (e;, ex) is the sectional curvature corresponding to 
the plane spanned by the vectors e; and ex. 


HISTORICAL COMMENT Ricci-Curbastro introduced the Ricci tensor in 1903. 


5.9 Riemannian Submanifolds 


Let M be a submanifold of a Riemannian manifold M. It is equipped with the 
induced Riemannian metric: the inner product of tangent vectors to M equals the 
inner product of these vectors considered as tangent vectors to M. As a result, we 
obtain two Levi-Civita connections: V on the Riemannian submanifold M and V 
on M. 

The tangent space T,M to M ata point p € M can be represented as the direct 
sum of the tangent space T,M and the normal space N,M. Therefore, for each 
tangent vector X in the space T)M , we can consider its projections X’ on the 
tangent space T,M and X + on the normal space N pM. 

Let X and Y be vector fields on M. We can extend them to vector fields on M i 
take the covariant derivative Vx Y, and represent it in the form (VxY)? + (VxY)t. 
We will see shortly that the tangential component (VyY)! is VyY. The normal 
component (Vx Y)+ is, by definition, the second fundamental form II(X, Y). With 
the second fundamental form we can associate the second quadratic form II(X, X). 
For hypersurfaces, we identify the normal space with numbers, and therefore the 
second quadratic form takes numerical values. In the general case, the second 
quadratic form takes values in a linear space of dimension equal to the difference of 
the dimensions of the manifolds M and M. 
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The symmetry of the Levi-Civita connection implies that of the second funda- 
mental form: 


TX, Y) — W(Y, X) = (VxY)t — (VyX)t = [X, y+ =0, 


because the commutator of vector fields on a manifold M is a vector field on M. 


Theorem 5.14 (Gauss’ Formula) /f vector fields X and Y on M are extended to 
M, then 


VxY¥ =VxY+I(X, Y). 


Proof We must prove that the tangential component (Vx Y)" equals Vx Y. Consider 
the map V’: TpM x Ty)M — TyM defined by viy = (VxY)". Let us show that 
Vv! = V. In view of the uniqueness of the Levi-Civita connection, it suffices to 
check that V" is a symmetric connection compatible with the metric. 

First, we check that V" is a connection: 


(Vex)? = f(VxY)? = fVEY, 
(Vx fY)" = (ax fyY? + f(VxY)? 


= (dx f)Y¥" + fVXY. 


Then we check the symmetry of V7: 


ViY — VEX = (VxY —Vyx)? 


= [X, Y]" = [X, YI. 
Finally, we check that the connection V’ is compatible with the metric: 


dx (Y, Z) = (VxY, Z) + (Y, VxZ) 
= ((VxY)", Z) + (Y, (VxZ)*) 
= (VxY, Z) + (Y, VxZ). 


| 


For a surface in R*, the second quadratic form makes it possible to calculate the 
covariant derivative of a normal vector field (by the Weingarten formula); a similar 
formula also exists for hypersurfaces. In the general case, the following theorem 
holds. 


5.9 Riemannian Submanifolds 189 


Theorem 5.15 (Weingarten Formula) Let X and Y be tangent vector fields on a 
Riemannian submanifold M C M, and let N be anormal vector field. Suppose that 
they are extended to vector fields on M. Then 


(VxN, Y) = -(N, W(X, Y)). 
Proof Clearly, (NV, Y) = 0. Therefore, 


0 = dx(N, Y) = (VxN, Y) + (N, VxY) 
= (VxN,Y)+ (N, Vx¥ + W(X, Y)) = (VxN, Y) + (N, WX, Y)), 


because the vectors N and Vx Y are perpendicular. Oo 


The second fundamental form also makes it possible to calculate the difference 
between the curvature tensors on the manifolds M and M. We denote these tensors 
by R(X, Y, Z, W) and R(X, Y, Z, W). Here X, Y, Z, and W are vectors in T,M. 


Theorem 5.16 (Gauss Equation) The curvature tensors on a manifold and a 
submanifold are related by 


R(X, Y, Z, W) = R(X, Y, Z, W) — (II(X, W), HI(Y, Z)) + (IICX, Z), IW’, W)). 


Proof Let us extend vectors X, Y, Z, and W first to vector fields in a neighborhood 
of a point p in the manifold M and then to a neighborhood of this point in the 
manifold M. According to the Gauss formula, we have 


R(X, Y,Z,W)= (Vx VyZ - VyVxZ - VixyZ, W) 
= (Vx(VyZ + IY, Z)) — Vy (VxZ + W(X, Z)), W) 
— (Vrx,y]Z + T([X, Y], Z), W). 


The second fundamental form takes values in the normal space, and W is a 
tangent vector; therefore, (II([X, Y], Z), W) = 0. To the two other terms containing 
the second fundamental form the Weingarten formula applies: 


R(X, Y, Z, W) = (VxVyZ, W) — (II(Y, Z), IX, W)) 


— (Vy VxZ, W) + (IX, Z), HWY, W)) 
— (Vix, y1Z, W). 
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Now we decompose V into a normal and a tangential component. All inner 
products of the normal component and tangent vectors vanish, so that only the 
tangential component remains: 

R(X, Y, Z, W) = (VxVyZ, W) — (Vy VxZ, W) — (Vpx,viZ, W) 
— (L(Y, Z), W(X, W)) + UI(X, Z), IW’, W)) 
= R(X, Y, Z, W) — ((X, W), IY, Z)) + (I(X, Z), IY, W)). 


Oo 


HISTORICAL COMMENT In 1827 Gauss derived an equation expressing the differ- 
ence between the Riemann curvature tensor of a manifold and that of a submanifold 
in terms of the second quadratic form in the case of a surface in three-dimensional 


space. In this case, R (X, Y,Z,W) = 0 and the Gauss equation expresses 
R(X, Y, Z, W) in terms of the second quadratic form. 
Theorem 5.17 


(a) Let V be a vector field given along a curve y on a Riemannian submanifold 
M Cc M. Then Vy'V = Vy'V +1)’, V). 

(b) Let y be a geodesic on a Riemannian submanifold M Cc M. Then Vyy' = 
My’, y’). 

Proof 


(a) Consider a moving frame e; (i = 1, ..., dim™M) along the curve y on the 
manifold M. Let us expand the vector V(t) at a point y(t) in this frame: V(t) = 
V' (t)e;. We have 


Vy V = Vy View = (Vie + VV ye 
= (Vie + Vi Ve; + VM’, 6) = VyrV + IN’, V). 


(b) Let us apply the equation in (a) to a vector field V equal to the velocity of the 
curve y: 


Vy! = Vyy' +’, vy’). 


It remains to note that Vy’ = 0 for a geodesic. 
oO 


Now we are ready to revisit sectional curvature and prove its property which was 
stated but not proved. Consider a two-dimensional plane IT in the tangent space at 
a point p € M”. Take a neighborhood of zero in this tangent space, small enough 
for the map exp, to be a diffeomorphism in this neighborhood. Let St be the two- 
dimensional surface being the image under exp, of the part of IT contained in the 
chosen neighborhood. 
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Theorem 5.18 The sectional curvature Ky equals the Gaussian curvature K of the 
surface Sy at the point p. 


Proof First, we prove that the second fundamental form of the surface Sy regarded 
as a submanifold of M” vanishes at p. Given any vector V € T,Sr, consider a 
geodesic y = yy on M” for which y(0) = p and y’(0) = V. According to 
Theorem 5.17, we have 


0= Vyy' = Vyy’ +1’, y’). 


The vector V,’y’ lies in the tangent space, and II(y’, y’) lies in the normal space; 
therefore, each of them vanishes. Thus, II(V, V) = 0 for any vector V € T, Sn, 
and since the second fundamental form is symmetric, it follows that it vanishes at 
the point p. 

The Gauss equation shows that at the point p, for tangent vectors to the surface 
Sq, the Riemann tensor for the surface Sy coincides with that for the manifold M”. 
Therefore, the sectional curvature corresponding to the plane ITI equals the sectional 
curvature of the surface Sy, i.e., the Gaussian curvature of Sj at the point p. oO 


5.10 Totally Geodesic Submanifolds 


A Riemannian submanifold M C M is said to be totally geodesic at a point p © M 
if a sufficiently small part of any geodesic line on M passing through the point p is 
also geodesic on the manifold M.An example of a submanifold totally geodesic at a 
point p was considered in the preceding section: this is the two-dimensional surface 
St being the image of a neighborhood of zero in a plane IT under the map exp,. 


A Riemannian submanifold M C M is said to be totally geodesic if it is totally 
geodesic at each point. This means that any geodesic line on M is also geodesic 
on M. Examples of geodesic submanifolds are geodesic lines on a Riemannian 
manifold and the intersection of the sphere with a subspace through its center. 

In the proof of Theorem 5.18 we showed that, for the surface Sp, the second 
fundamental form at a point p vanishes. This property is characteristic of totally 
geodesics submanifolds. 


Theorem 5.19 A Riemannian submanifold M C M is totally geodesic if and only 
if its second fundamental form W(X, Y) is zero. 


Proof Let y be a geodesic on a Riemannian submanifold M Cc M. Then, according 
to Theorem 5.17, we have Vyy' = II(y’, y’). Therefore, any geodesic on the 
manifold M is also a geodesic on the manifold M if and only if the second quadratic 
form II(X, X) is zero. The second fundamental form II(X, Y) is symmetric, and 
hence it is zero if and only if so is the second quadratic form. oO 
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Theorem 5.20 Let y be acurve ona totally geodesic submanifold M C M. Then 


(a) the parallel transports of a tangent vector to M along y inside M and inside M 
coincide; 
(b) the transport along y of a vector orthogonal to M is orthogonal to M. 


Proof 


(a) Let V bea vector field on the manifold M parallel along the curve y, i.e., such 
that VyiV = 0. We must prove that V is also parallel on the manifold M. 
According to Theorem 5.17, we have VyV =VyV+ II(yv’, V) = 0, because 
the second fundamental form is zero for a totally geodesic manifold. 

(b) The parallel transport along y of a vector V orthogonal to M and a vector W 
tangent to M preserves the angle between these vectors, and it takes W to a 
tangent vector to M. Therefore, the transported vector V is orthogonal to all 
tangent vectors to M. 

oO 


Problem 5.11 Suppose that the fixed point set of an isometry o of a Riemannian 
manifold M is a connected submanifold of M. Prove that the submanifold M Cc M 
is totally geodesic. 


5.11 Jacobi Fields and Conjugate Points 


The first and second variation formulas and their proofs in the many-dimensional 
case are the same as in the two-dimensional case (see Sects. 3.18 and 3.19). The 
definition of Jacobi vector fields and conjugate points is precisely the same as well 
(see Sect. 3.20). However, in the many-dimensional case, it makes sense to consider 
the multiplicity of conjugate points. Recall that we have proved that, in the two- 
dimensional case, their multiplicity is always 1. The same argument shows that, in 
the n-dimensional case, the multiplicity of conjugate points is at most n—1. Namely, 
the dimension of the space of Jacobi fields vanishing at t = 0 equals n; moreover, 
there exists a Jacobi field which vanishes at tf = 0 and does not vanish at ¢ = 1 (an 
example of such a field is J(t) = tV(t), where V(t) = y’(t) is the velocity vector 
of a geodesic). 

Let us show that the multiplicity of a conjugate point on an n-dimensional 
Riemannian manifold may equal n— 1. Consider diametrically opposite points p and 
q on the sphere S”. Jacobi fields can be obtained by using variations in the class of 
geodesics with loose endpoints. Moreover, in the case where the endpoints are fixed, 
Jacobi fields vanishing at these endpoints are obtained. The one-parameter families 
of motions of the sphere S” which leave the points p and q fixed determine Jacobi 
fields vanishing at p and q. Such motions of the sphere S” correspond to motions 
of the equatorial sphere S’~!. The sphere S’~! can be rotated in n — 1 independent 
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directions; therefore, we obtain n — 1 independent Jacobi fields vanishing at the 
points p and q. 

The proof that a geodesic containing two conjugate points is not a shortest curve 
with given endpoints remains the same. 

The proof of the theorem about a relationship between conjugate points and 
critical points of the map exp needs no substantial changes: it suffices to take a 
basis consisting of n vectors X1,..., X» instead of a basis consisting of two vectors 
X1, X2. 

In the study of Jacobi fields, of most interest are those orthogonal to geodesics. 
The point is that, given any geodesic y, the tangent vector field (a+ bt)y’(t), where 
a and b are constants, is a Jacobi field. Such Jacobi fields are of little use, and it is 
better to exclude them from consideration. 


Theorem 5.21 [fa Jacobi field J is orthogonal to a geodesic y at two points, then 
it is orthogonal to y at all points. 


Proof Since Vy'y" = 0, it follows that Ls, Y= WAV) + (SG Vyyv’) = 
(V,/J,y’) and Li, yy = (Vols y’). Using the Jacobi equation Vid _ 


RO, Dy’ = 0, we obtain SU, y') = (RY. Dy) = RY. Ly’) = 0 
by virtue of the symmetry properties of the curvature tensor. Thus, (J(t), y’(t)) is 
a linear function of t. If it vanishes at two points, then it also vanishes at all other 
points. Oo 


For calculating Jacobi fields normal coordinates are convenient. Normal coordi- 
nates on a Riemannian manifold M” are defined in precisely the same way as for a 
surface. They are introduced in a sufficiently small neighborhood of a point p € M”. 
Normal coordinates correspond to the image of a Cartesian coordinate system in the 
tangent space at p under the map exp,. Let 1, ..., én be an orthonormal basis in 
T,M", and let g = exp, (x!e1 +++» +x"e,). Then (x!,...,2x”) are the normal 
coordinates of the point q. 

The differential of the map exp, at the origin is the identity map; therefore, in 
normal coordinates the first quadratic form at a point p is (dx!)? + --- + (dx")?, 
Le., 9; = 4;;, and the geodesics through p are given by the xt=te!,...,x" = te". 

Exactly the same argument as in the case of a surface proves that all Christoffel 
symbols at a point p vanish in normal coordinates (see the solution of Problem 3.28). 


Theorem 5.22 Let x!,..., x” be normal coordinates in a neighborhood of a point 
p, and let y(t) be a geodesic through p (y(O) = p). Then, for any vector W = 
Wid; € T,M", a Jacobi field J along y for which V,,J(O) = W is given by 
J(t) = tW' 90; in normal coordinates. 


Proof Consider the vector field on y(t) defined by J(t) = tW‘d; in normal 
coordinates. At the origin all Christoffel symbols are zero; therefore, V,/J(0) = 
(tW')'d; = W'd; = W. Thus, the vector field J satisfies the initial condition, and 
hence it suffices to check that J is a Jacobi field. Let us construct a variation with 
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variation vector field J in the class of geodesics. The geodesic y(t) is given by 
y(t) =(tV!,...,tV”), where V = y’(0). Consider the variation 


y(s,t) =(tV! +stw!,...,tV" + stW”). 


This is a variation in the class geodesics, so that ay(0, t) is a Jacobi field. Clearly, 
Ay(0,t) = (¢W!,...,1W") = J (0). o 


Using Theorem 5.22, we can obtain the following expression for the metric in 
terms of the tensor curvature in normal coordinates. 


Theorem 5.23 Ler x!, ..., x” be the normal coordinates of a point x. Then, in 
normal coordinates, 


i ae 1 ve pxge ell 2 
Bij (x) = bij + 3 Rimix x + o((|x||*). 


Proof Let p be the origin. To each point x we assign the vector X with the same 
coordinates and consider the geodesic y(t) = exp (tX). Consider also the Jacobi 
fields Jj (i = 1, ..., m) along this geodesic for which J;(0) = 0 and (V,,/J;)(O) = 
0;. These Jacobi fields have the form J;(t) = td;. We are interested in the inner 
product (0;, 0;) at the point y(1). We begin with calculating the derivative of the 
inner product (J;(t), J; (¢)) with respect to f. 


df (y(t)) 
dt 


Clearly, = 0, f, whence 


d 
a ted) = Wy Gad) = Vy tdy) + Gp Vy): 


For brevity, we will denote the derivative with respect to tf and the covariant 
derivative V,, by a prime. The calculation of derivatives can easily be continued: 
(Ji. Jj)" = I) + 20}, I) + i, I): 
(Hi Jy)” = (HT) + 30, FE) + 3G, FD) + is 2; 
(Ji, een = Ce J) +40", Ji) + 6(J;", Jj) +45, Ji) + (Ji, eb 


We are interested in the values of these derivatives at t = 0. For t = 0, the Jacobi 
equation has the form (Vx)? J; = R(X, J;)X. By assumption, J;(0) = 0, whence 
J!'(0) = 0. Thus, for t = 0, all terms containing J or J” vanish. Therefore, the first 
and third derivatives are zero: 


(Ji, Jj)" (0) = 205}, Jj) O) = 2(;, 9)), 
(Ji, Jj)" O) = 4057" (0), 87) + 4G, J” 0). 
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To calculate J’”’(0), we use the Jacobi equation J” = R(X, J)X. As a result, we 
obtain 


PS ROG DEERE NER IIE RI, 


Taking into account the relations /(0) = 0, X’ = 0, and J! (0) = 0;, we see that 
J"(0) = R(X, J/(0))X = R(X, 0;)X, and hence 


(J1""(0), 83) = (R(X, 4X, Bf) = (8), JNO) = Rjaix*x!, 


It is required to calculate (0;, 0;) at the point y(1). At this point 0; = Jj; thus, 
we must calculate (Jj, J;)(1). We have 


1 1 
Fis IO = Gi, AYO? +s Rjeix xl $= GO + y RKO +o 


For t = 1, we obtain the required relation. oO 


Theorem 5.22 gives an expression for a Jacobi field on any Riemannian manifold. 
In the case of a space of constant curvature, we can obtain another explicit 
expression for a Jacobi field orthogonal to a geodesic. 


Theorem 5.24 Let M” be a Riemannian manifold of constant sectional curvature 
c, and let y be a geodesic with natural parameterization. Then a Jacobi field along 
y orthogonal to y and vanishing at t = 0 has the form J(t) = u(t)n(t), where n is 
a field of vectors orthogonal to y which is parallel along y and the function u(t) has 
one of the following forms (depending on the sign of c): t (for c = 0), Rsin(t/R) 
(for c = Zz > 0), and R sinh(t/R) (for c = —Zy < 0). 


Proof Recall that the Jacobi equation for a Jacobi field J along a geodesic y has 
the form (Vy) J — R(V,J)V = 0, where V = y’ is the velocity vector of 
the geodesic. In the proof of Schur’s theorem (Theorem 5.13) we showed that, 
for a space of constant sectional curvature c, the curvature tensor has the form 
R(X, Y)Z = c((Y, Z)X — (X, Z)Y). Substituting this expression into the Jacobi 
equation, we obtain 


0 = (Vy)? J —c((J, V)V — (V, V) J) 
=(Vy)J+cJ, 


because the vectors J and V are orthogonal and ||V || = 1. 

Let us find out what fields of the form J(t) = u(t)n(t), where u(t) is a function 
and n(ft) is a field parallel along y and orthogonal to y, are solutions of the obtained 
equation. Clearly, (Vy)?(u(t)n(t)) = u(t)n(t), because the vector field n(t) is 
parallel along y, i.e., Vyn = 0. Thus, we arrive at the equation vu” (t) + cu(t) = 0 
with initial condition u(0) = 0. Up to proportionality the solutions of this equation 
coincide with those in the statement of the theorem. 
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The dimension of the space of vector fields parallel along y and orthogonal to 
y equals n — 1. The dimension of the space of Jacobi fields orthogonal to y and 
vanishing at t = 0 is at most n — 1. Therefore, in fact, the particular solutions found 
above exhaust all solutions. Oo 


Employing both expressions for a Jacobi field, we can describe the structure of 
the metric of a space of constant curvature in a sufficiently small neighborhood and, 
thereby, prove that a space of constant curvature is locally isomorphic to the sphere, 
Euclidean space, or hyperbolic space. 


Theorem 5.25 Let M” be a Riemannian manifold of constant sectional curvature 
c, and let x',..., x" be normal coordinates in a neighborhood of a point p. Suppose 
that a point q of M" lies ona sphere (x!)*+---+(x")? = r? anda tangent vector V 
at q is represented in the form V = V+ +. V", where V+ is a vector perpendicular 
to the sphere (with respect to the normal coordinates) and V" is a vector tangent 
to it. Then the metric g of the Riemannian manifold is expressed in terms of the 


Euclidean norm || - || in normal coordinates as follows: 
IVF + IVF = IVP ifc =0, 
27. i 
gV.V)= FVII? +4 (sin? 2) IVI? ifc=p>0, 
ome : : 
V+? + & (sink? &) V7 Fife =z < 0. 


Proof Let || Vile = g(V, V). According to Gauss’ lemma, the vectors V+ and V7 
are orthogonal with respect to the metric g, so that || Vile = IV—lle + |v" Iz. The 
length of the velocity vector of a geodesic from the point p is the same in both 
metrics; therefore, || V+ Fe = ||V+||*. It remains to calculate || V7 lle. 

Let y be a radial geodesic with natural parameterization joining the points p = 
y (0) andg = y(r), and let W = Vv’. According to Theorem 5.22, a Jacobi field J 
along y vanishing at p and equal to W at g is given by J(t) = twi 0;. Since the 
Jacobi field J is orthogonal to the geodesic y at the points p and q, it follows that it 
is also orthogonal to it at all other points (see Theorem 5.21). 

Now we apply Theorem 5.24 and represent a Jacobi field J on a Riemannian 
manifold of constant sectional curvature c which is orthogonal to y and vanishes at 
t = 0 in the form J(t) = u(t)n(t), where n is a field of vectors orthogonal to y 
which is parallel along y and u(t) is a function of one of the three forms specified in 
the statement of the theorem. Then V,, J (0) = u’/(0)n(0) = n(0), because u'(0) = 1 
in all of the three cases. Since the field n(t) is parallel, it follows that the length of 
the vector n(t) is constant and equals the length of n(0). Therefore, 


IWie = OIg = lu )nOlle = le@/? - Vp FOI. 
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Next, we have V,J(0) = wi 0; € TpM". At the point p both metrics coincide, so 
that 


Sea 1 
IM FOlle = —NW'dille = — NWI. 


2 
Pi? ur) Sof Reale 
|V° ||", where —+ = 1 fore = 0, —+ = Ssin% for 


Thus, ||V7|2 = 


u@) _ R 
rr 


u(r) 
= 


c > 0, and sinh & for c < 0. oO 


Corollary Riemannian manifolds M and M' of dimension n which have the same 
constant sectional curvature are locally isometric, i.e., any points p € M and p' € 
M' have isometric neighborhoods. 


Proof Given points p and p’, consider geodesic balls of sufficiently small radius e. 
Using normal coordinates in these balls, we can map one of the balls onto the other, 
and in normal coordinates the Riemannian metrics on these balls are the same. O 


A Riemannian manifold locally isometric to Euclidean space is said to be flat. A 
manifold is flat if and only if one of the following conditions holds: (1) all sectional 
curvatures are zero; (2) the curvature tensor is zero. Indeed, the curvature tensor of 
a flat manifold is the same as that of Euclidean space, i.e., zero. If the curvature 
tensor is zero, then so are all sectional curvatures, and therefore the manifold is 
locally isometric to Euclidean space. 


5.12 Product of Riemannian Manifolds 


Consider a manifold M = M, x Mp) which is the direct product of manifolds M, 
and M>. Let X; and X2 be vector fields on Mj and on M2. They can be assigned the 
vector field X; + X2 on M defined by 


(X1 + X2)(p,q) = (X1)p @ (X2)q- 


To each vector X(p,q) € T(p,q)M we can assign vectors (X1)p» € T,M, and 
(X2)q € TqMz so that X(pq) = (X1 + X2)(p,q). Therefore, given any vector field 
X on M, at each point (p, q) we can write the expression X(p.q) = (X1 + X2)(p,q), 
where X; and X2 are some vectors at the points p and q. But a vector field X can 
be represented in the form X = X; + X2 only in the case where (X1)p € T,)M, in 
this expression does not depend on g and (X2)q € T Mz does not depend on p. 

The product of Riemannian manifolds Mj and M2 is the manifold M = M, x 
Mp? with Riemannian metric equal to the sum of the Riemannian metrics of the 
manifolds M; and Mp. In other words, for X;, Y; « TM;, 


gm(X1 + X2, ¥1 + Y2) = gm, (X1, V1) + guy (X2, ¥2). 
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It suffices to define inner product in each tangent space. For arbitrary vector fields 
X and Y on M, gm (X, Y) is calculated at each point separately. 

Clearly, the tangent spaces to the manifolds M; and M2 are orthogonal in the 
tangent space to the manifold M. 

In the rest of this section, we assume that X;, Y;, Z;, Wi € T Mj. 

Let us show that, for any connections V, V!, and V2 on the Riemannian 
manifolds M, M,, and Mo, we have 


Vx,4%(%1 + Yo) = Vx" + VX, Yo. (5.7) 


First, we check that Vx, Y2 = 0 and Vy, Y; = 0. We prove only the first relation 
(the proof of the second is similar). We apply Koszul’s formula 


2(VxY, Z) = dx(Y, Z) + dy(X, Z) — 02(X, Y) 


Note beforehand that [X1, Y2] = 0. Here X; and Y2 are regarded as vector fields 
on M; the coordinates of the vector X; corresponding to Mp? are zero, and its 
coordinates corresponding to M, do not depend on those coordinates of the point 
which correspond to M2. We check that [X1, Y2] = 0 coordinatewise. Namely, 
we assume that X; = fe; and Y2 = ge;, where f does not depend on the jth 
coordinate and g does not depend on the ith one. Then 


[fe gejl = fgle,ej]+ fge; — g(0; foei = 9, 


because each of the three terms is zero. 

It suffices to prove that (5.7) holds at each point. Thus, we can assume that a 
vector field Z has the form Z = Z, + Z> (we can represent any vector Z in this 
form at a given point and then extend this vector to a vector field constant in each 
of the two components). Let us write Koszul’s formula for X = X1, Y = Yo, and 
Z = Z, + Zo. Clearly, we have (X1, Y2) = 0 and [X), Y2] = 0. Therefore, 


2(Vx,Y2, Z) = dx, (Y2, Z) + dy,(X1, Z) — ([X1, Z], Y2) — ([¥2, Z], X1) 
= dx, (V2, Z2) + dy, (X1, Z1) — ((X1, 21], Y2) — (2, Z2], X1). 


All terms in this expression are zero. For example, dx,(Y2, Z2) = 0 because the 
coordinates of the vectors Y2 and Z2 do not depend on the coordinates of the point 
in the first component, and ([X1, Z1], Y2) = 0 because the vectors [X1, Z;] and Y2 
are orthogonal. 

Let us check that Vx, Y) = VE, Y (the proof of the equation Vx, Y2 = Vi, Y2 is 
similar). Again applying Koszul’s formula, we see that 


(Vx, %1, Z,+Z2)= (Vx,%1, Z1) = (Vx,N1, Z1) = (Vx, "%1, Z1 + Z2). 
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Theorem 5.26 R(X; + X2, Y1 + Yo, Z1 + Z2, Wi + Wo) = R(X, M1, Z1, Wi) + 
R(X2, Yo, Z2, W2). 


Proof We use formula (5.7), the definition 


R(X, Y)Z = VxVyZ — Vy VxZ — Vix, yjZ 
of the curvature tensor, and the fact that [X,, Yo] = 0. As a result, we obtain 
R(X) + X2, V1 + Y2)(Zi + Z2) = R(X, V1)Z1 + R(X2, Y2)Zo. 
Therefore, 


R(X, + X02, V1 + Y2, Z1 + Zo, Wi + W2) 
= (R(X) + X2, Y1 + Y2)(Z1 + Z2), Wi + Wr) 
= R(X1,%1, Z1, Wi) + R(X2, Yo, Z2, W2). 


Oo 


In particular, the sectional curvature of the plane spanned by the vectors X; and 
Y> is zero. 


Problem 5.12 Prove that M; and M2 are totally geodesic submanifolds in M, x M2. 


Problem 5.13 Prove that a curve on a Riemannian product of manifolds is geodesic 
if and only if its projection on each factor is geodesic. 


5.13. Holonomy 


The parallel transport of a vector along a piecewise smooth curve y(t), t € [a, b], 
is defined in a natural way. Let y(t) be smooth everywhere except at points ty < 
t2 <--+ < t,. Then we first transport the vector from the point y (a) to y(t) along 
a smooth curve, then from y (t1) to y(t2), and so on. 

The parallel transport along a piecewise smooth Joop, i.e., a curve which begins 
and ends at the same point p, defines a self-isometry of the space T,, M". Given two 
loops, we can compose a single loop which goes first along one of the loops and then 
along the other. (It is for this purpose that we consider piecewise smooth rather than 
smooth loops: a loop composed of two smooth loops is only piecewise smooth.) 
Thus, the isometries under consideration form a group Hol), which is a subgroup of 
O(n); the inverse element corresponds to the loop traversed in the reverse direction. 
This group is called the holonomy group. Considering only contractible loops, we 
obtain the so-called restricted holonomy group Hol? 

The holonomy groups Hol, and Hol, of a connected manifold at different points 
p and q are isomorphic. The isomorphism can be constructed as follows. Choose 
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any smooth path @ with initial point p and terminal point g. To a loop y with initial 
and terminal point p we assign the loop @ = a !ya with initial and terminal 
point g (first, is goes along a~!, then along y, and finally along a). This map is 
a group homomorphism, because the parallel transport along the paths a~! y,y2a 
and a~!y,aa~!y2a@ yields the same result. Indeed, the transport along aa~! is the 
identity transformation, because as the path a is traversed in the reverse direction, 
the vector repeats its motion backwards. The homomorphism Hol, — Hol, defined 
by w + awa™! is inverse to the homomorphism Hol, — Hol,. The restricted 
holonomy groups Hol? and Hol? are isomorphic as well; the proof is exactly the 
same. In what follows, we usually do not specify the point at which the holonomy 
group is calculated. 

A connected manifold is orientable if and only if its holonomy group is contained 
in SO(n). Indeed, given any path, we can define the parallel transport of the 
tangent space orientation along this path. A manifold is orientable if and only if 
the orientation is preserved by the parallel transport along any loop. On the other 
hand, the holonomy group is contained in SO(n) if and only if the orientation is 
preserved by the transport along any loop. 


Problem 5.14 Prove that Hol(S”) = SO(2). 
Problem 5.15 


(a) Prove that Hol(S*) = SO(3). 
(b) Prove that Hol(S”) = SO(n). 


Problem 5.16 


(a) Prove that the holonomy group of a cone with a flat metric is the cyclic group 
whose generator is the rotation throught the angle at the vertex of the cone net. 
(We consider a cone without vertex in order that the surface be smooth.) 

(b) Prove that the holonomy group of the Mébius band with flat metric consists of 
the identity map and the map Vr —V. 


Problem 5.17 Prove that the holonomy group of a product of Riemannian mani- 
folds is the product of holonomy groups. 


5.14 Commutator and Curvature 


Curvature can be interpreted as infinitesimal holonomy. We begin with a similar 
interpretation in the simpler case of a commutator of vector fields. Our exposition 
follows [Fal]. 

Consider vector fields X and Y. We can locally define an integral curve 
(trajectory) of the vector field X, that is, a curve y for which the velocity vector 
at a point y(t) equals X (y(t)). Using trajectories, we can define the map X;, which 
equals the displacement along a trajectory during time ¢ (this map is not defined if 
the trajectory cannot be extended to f). 
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Fig. 5.2. The map o 


Pi (0) P» 


For a function f(y (t)) ona trajectory y, the Taylor series has the form 


n 


k 
fXp)— f(D) = To" fp) +00), 


k=1 


2 k 
Indeed, we have af = x? or = dx f, whence “f = (ax)* f. 


Given a sufficiently small t, we set o(t) = Y;,X;Y_;X_,;p. The map o is 
schematically shown in Fig.5.2. As tf changes from 0 to some value fo, the point 
o(t) traces a curve joining the points p = p3 and pa. 


Theorem 5.27 For any smooth function f, 


f(o(t)) — fe) = Pax yf (p) + o(t*), 


km 2 (vt)) — f(o)) 
LL 


=0 : 
a ; x,y f (Pp) 


Proof Let us write the Taylor series for all sides of the parallelogram: 
f (pa) — f (pi) = tay f (pi) + Eyton + o(t*), 
f (pi) — f (po) = tax f (po) + E(x)? F(p0) + o(t*), 
f(p3) — f (po) = tax f(p2) + E(x)? F(p2) + o(t*), 


2. 
f (pr) — f (po) = tay f (po) + 5 (yy? f(p0) +o(1?). 
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The difference f(o(t)) — f(a (O)) = f(p4) — f (p3) can be represented in the form 


(f (pa) — f(p1)) + (Ff (pv) — f (po) — (Ff (ps3) — f(p2)) — Cf (p2) — f (po))- 


Up to o(¢”) this expression equals 


t (dy f (p1) — dy f (po)) —t (Ox f (p2) — Ax f (po)) = t7 (Ax ay — Ay Ax) f (po) +0(t7); 


the differences of the form 


t2 
5 (Ory fev — (ay)* f(po)) 


are of order o(t), and they can be ignored. Oo 


Now we proceed to the curvature. In calculating the curvature it is convenient to 
consider commuting vector fields, and therefore we assume that the vector fields X 
and Y commute. In this case, we have o(t) = Y;X;Y_;X_1p = p, i.e., the path 
along the sides of the parallelogram is closed (see Fig. 5.3). 

We need the following version of Taylor’s formula for the parallel transport of a 
vector along a curve. Let X be a vector field in a neighborhood of a curve y. We set 
V = y’(0) and denote the transport of a vector along the curve y to a point y(t) by 
t,. Then 


nok 
0X (Y(1)) = X(7(0)) = DI GVEX +00"), 
k=1 ~ 
Fig. 5.3. The closed path VY 
Py P 


Ph Pe 
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Indeed, let us apply Taylor’s usual formula to the function f(t) = tX(y(t)). To 
calculate 


X(y(t +h)) — mwX (v(t 
f'@) = lim TX (y(t + h)) — 0X (y( y 
h—>0 h 
we first transport the vector X (y(t + h)) to the point y (ft) and then transport both 
vectors T,X (y(t + A)) and X (y(t)) to the point y (0). As a result, we obtain 


uX(y(it+h)) — X(v@) 


'(t) = To li = 19Vy(t)X. 
f@ tole h TOVy'(t) 


Therefore, f*(t) = T(r) X) and f*(0) = Vi-X. 


Theorem 5.28 Let X and Y be commuting vector fields. Then the displacement 
during time t along trajectories of the fields —X, —Y, X, and Y (i.e., under the 
traversal p > p2 —> po > Pi — Pp) of a tangent vector Z at the point p is 


AZ =t’R(Y, X)Z +0(t?), 
and therefore 
li = R(Y, X)Z 
7 aaa 


Proof Let us denote the transport of a vector to the point p; along the corresponding 
arc by t; and the vector of a vector field Z at the point p; by Z;. As in the proof of 
Theorem 5.27, consider four differences: 


™1Z—Z,=tVyZi+ evi + o(t?), (5.8) 
t|Z1 — Zo =tVxZot+ Ev}zZ0 + o(t?), (5.9) 
™Z — Zy =tVxZ2 + Cvhz, + o(t?), (5.10) 
tZ2 — Zo =tVyZot+ CvbZ0 + o(t?). (5.11) 


Let us apply the parallel transport to to both sides of Eqs. (5.8) and (5.10): 


2 
t 
tmtmZ — |Z) =tt(VyZ1) + 5 10VyZ1) +f), (5.12) 


2 
mmZ — |Z. = tt(VxZ2) + 5 10V x Z2) dat", (5.13) 
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Summing (5.12) and (5.9) and subtracting the sum of (5.13) and (5.11), we 
obtain, after cancellations, tot; Z — tot2Z on the left-hand side. Next, 


ttoVy Z1 —tVyZo = t?VxVy Zo + o(t’), (5.14) 
ttoVx Zo —tVx Zo = t’ Vy Vx Zo + o(t’). (5.15) 


The differences £ (0V2Z1 _ V;Zo) and © (10V2Z2 _ Vi Zo) are of order o(t?), 
and we ignore them. As a result, we arrive at 


tt1Z — t2Z =t*[Vx, Vy]Zo + 0(t?). 


The transports tot; Z and tot2Z correspond to the traversals p — pi — po and 
DP — p2 — po, and we therefore denote them by t(p > pi — po)Z and t(p > 
p2 — po)Z. After we apply the transport t(p > p1 > po)! = t(po > pi > 
Pp) to both sides of the obtained equation, the left-hand side takes the form 


Z-—t(p> po> por pi > p)Z=—-AZ, 
and the right-hand side becomes, up to o(¢7), 


t(po > pi > p)t?[Vx, Vy]Zo = t?[Vx, Vy]Z =? R(X, Y)Z 


= —17?R(Y, X)Z. 


oO 


Remark Not only do Theorems 5.27 and 5.28 have similar proofs, but also 
Theorem 5.28 can be derived from Theorem 5.27; see [Sal]. 


Theorem 5.29 A Riemannian manifold is flat if and only if its restricted holonomy 
group is trivial. 


Proof First, suppose that the restricted holonomy group of a Riemannian manifold 
is trivial. Then by Theorem 5.28 we have AZ = 0 for all transports along 
parallelograms; therefore, the curvature tensor is zero. Thus, all sectional curvatures 
vanish, and hence the manifold is locally isometric to Euclidean space. 

Now suppose that a manifold is locally isometric to Euclidean space. A con- 
traction of a loop can be represented as a sequence of its transformations in 
neighborhoods isometric to domains of Euclidean space, and hence parallel trans- 
port along a contractible loop is the identity transformation. Oo 
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5.15 Solutions of Problems 


5.1 Recall (see p. 95) that, for any function f, we have 
[fW,V] = f[W,V]—-@vfoW, ([W, fV] = flW,V)+ @wf)V. 


We need these relations to find out the behavior of the right-hand side of (5.1) under 
the replacement of X, Y, and Z by fX, fY, and fZ (we are not interested in the 
terms which are obtained by multiplication by f). Namely, to begin with, we must 
check that under the replacement of Z by f Z there arise no terms containing dy f, 
dy f, or Oz f, i.e., the vector Vy Y defined by (5.1) does not depend on Z for fixed 
X and Y. Then, we must verify that V is a connection, i.e., first, the assignment 
of the vector field Vy Y to vector fields X and Y is linear in X and Y and, second, 
VexY = fVxY and Vxy(fY) = fVxY + (x f)Y for any function f. 

Under the replacement of Z by fZ the terms containing 0x f arise in the first 
and fifth summands. These summands are (Y, (0x f)Z) and ((—dx f)Z, Y). Their 
sum vanishes. The calculation for dy f is similar. Terms containing 07 f arise in no 
summand. 

The linearity of the operation V in X and Y is obvious. Under the replacement 
of X by fX, in addition to terms obtained by multiplication by /, there appear the 
terms 


(dy f)(X, Z) — (0z F(X, Y) — (oy fy(Z, X) + (8z f(X, Y) = 0. 


This agrees with the formula Vex Y = fVxY. 
Under the replacement of Y by fY, there appear the terms 


(Ox f)(Y, Z) — (Oz f)(X, Y) + (0x IY, Z) + (8z f)Y, X) = 2(0x fy, Z). 


This agrees with the formula Vy(fY) = fVxY + (x f)Y. 
The relations 


(VxY, Z) + (Y, VxZ) = Ox(Y, Z) 
and 
(Vx¥, Z) —_ (Vy X, Z) S ([X, Y], Z) 


are easy to verify directly from the definition. Therefore, the connection V is 
compatible with the metric and has zero torsion. 


5.2 Obviously, d(x, y) thus defined is nonnegative and symmetric. The triangle 
inequality holds because, given a curve joining points p and q and a curve joining 
points g and r, we can compose a curve joining p and r. It only remains to check 
that d(x, y) > Oif x ¥ y. To this end, we choose a chart U — R” at the point x. 
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Let (V, W) be the inner product of vectors V and W in R”, and let g(y)(V, W) = 
a(y)ij V'W! be the inner product of these vectors in the tangent space at the point 
y € U with respect to the given Riemannian metric. Consider a closed ball D of 
radius r centered at x which is contained in U. On the set of all pairs (y, V) with 
y € Dand||V|/? = (V, V) = 1 the function g(y)(V, V) attains a positive minimum 
C?. For any vector V # 0 and any number A ¢ 0, the ratios g(V, V)/||V||? and 
g(AV,AV)/||AV ||? are equal; therefore, given any nonzero vector in the tangent 
space at any point of the ball D, the ratio of its length in the Riemannian metric 
to its Euclidean length is at least C. Thus, if the point y lies inside this ball, then 
d(x, y) => C||x — y|l, and if it lies outside, then d(x, y) > Cr. 


5.3 It suffices to prove the statement for IR”. Let d) be the Euclidean metric of R”, 
and let d2 be a Riemannian metric on R”. If we prove that the metrics d; and d2 are 
related by c}d, < dz < cod), where c; and c2 are positive constants, then it will 
follow that they induce the same topology. In the solution of Problem 5.2 we showed 
that d2 > cid; in a closed ball D, using the fact that the function g(y)(V, V) has 
a positive minimum on the set of all pairs (y, V) with y € D and (V,V) = 1. 
To prove the inequality dz < c2d1, it suffices to note that this function also has a 
positive maximum. 


5.4 No. Take, for example, points x and y in the plane R? which are symmetric 
about the origin. Then these points cannot be joined by a smooth curve of minimum 
length on the connected Riemannian manifold R? \ {0}. 


5.5 Suppose that a geodesic y on the sphere S” passes through some point z and 
its velocity vector at this point equals V. Consider the plane which contains the 
vector V and passes through the center of the sphere and the point z. We can choose 
a coordinate system so that this plane is the coordinate plane Ox!x?, i.e., so that 
it is determined by the equations x> = --- = x"t! = 0. We must prove that 
the geodesic y lies entirely in the plane Ox!x*. Suppose that some point w of 
y has a nonzero coordinate x', i > 2. Consider the transformation which leaves 
all coordinates but x! intact and changes the sign of x'. The sphere, the point z, 
and the velocity vector of the geodesic at z are invariant under this transformation; 
therefore, so is the geodesic y. Thus, the transformation leaves the point w fixed. 
We have arrived at a contradiction. 


5.6 


(a) In the proof of Theorem 5.5 we have used only the fact that any geodesic 
through x can be extended without bound. 

(b) Let A be a closed bounded subset of M”. It follows from the boundedness of 
A that the distance from the point x to any point y in A does not exceed some 
number d. The points x and y can be joined by a geodesic of length at most d. 
Therefore, the set A is contained in the image of a closed ball of radius d under 
the map exp,., which is compact. The set A is closed, and hence it is compact as 
well. 
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5.7 According to Theorem 5.9, there exists a bounded Riemannian metric on M”. 
By assumption this metric is geodesically complete; hence the closure of a geodesic 
ball of any finite radius is compact. Since the metric is bounded, the whole manifold 
M” coincides with a geodesic ball of some finite radius. Therefore, M” is compact. 


5.8 Clearly, 
a 
ae ee Pw) 
a 
= R(Z,Y+B6BW,X+aZ,Y+ BW)+ R(X +aZ,Y+BW,Z,Y+ BW). 


Therefore, at the point a = 0, 6 = 0, 


2 


daap 


(R(X +a0Z,¥ + BW,X +aZ,Y + BW)) 


= R(Z, W, X,Y) + R(Z, Y, X, W) + R(X, W, Z, Y) + R(X, Y, Z, W) 
= 2R(X, Y, Z, W) +2R(Z, Y, X, W), 


a2 
—— (R(X +aW, Y + BZ,X +aW,Y + BZ)) 
dadp 
= R(W, Z, X,Y) + RWW, Y, X, Z) + R(X, Z,W, Y) + R(X, Y, W, Z) 


= 2R(X, Y, W, Z) + 2R(X, Z, W,Y). 


Moreover, R(Z, Y, X, W) — R(X, Z, W, Y) = R(Z, Y, X, W) + R(Z, X, W, Y) = 
—R(Z, W, Y,X) = R(X, Y, Z, W). 


5.9 The coordinates of the vector X = e, are X! =--- = X"”—-! = Oand X” =1; 
therefore, Ric(X, X) = Rij X' X/ =R = ae ae Rinkn, because 


nkn 


Rinnn = 9. It is also clear that K(e;, en) = Rjnjn, whence SS K(ej,en) = 
=] 
aa Rjnjn- 


5.10 In an orthonormal basis we have S = )“7_, Ric(ex, ex). Expressing the terms 
Ric(ex, ex) by the formula of Problem 5.9, we obtain the required result. 


5.11 Consider a geodesic y on the manifold M. Let V be the velocity vector of this 
geodesic at some point p. Under the isometry o all points of the curve y remain 
fixed, and hence V remains fixed under the map ox. 

Now consider a geodesic y on the manifold M passing through the point p and 
having the velocity vector V at this point. The isometry o leaves the vector V fixed, 
and hence it leaves fixed all points of the geodesic y. Thus, the curve y lies in M. 
This curve is locally minimal in M, because it is locally minimal even in M. Since 
the geodesics y and y in M have the same velocity vector at p, it follows that they 
coincide. 
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5.12 Consider, e.g., a geodesic y on the manifold M,. Let X be the velocity vector 
of this geodesic. Then Vy, X, = 0. Therefore, (5.7) implies Vx, X; = 0. Thus, the 
curve y is also geodesic on M. 


5.13 Consider a curve y on the manifold M = M, x Mp. Let us decompose its 
velocity vector into two components: X = X, + X2. Clearly, X; and X2 are the 
velocity vectors of the projections of y on M, and on Mo. By virtue of (5.7), we 
have 


Vxy4x9(X1 + X2) = Vy, X1 + Vy, X2. 


Therefore, a curve y on the manifold M is geodesic if and only if its projection on 
each of the factors is geodesic. 


5.14 Consider a triangle composed of arcs of great circles on the sphere. The 
parallel transport along its sides rotates a vector through an angle ofa+fB+y—zZ, 
where a, 8, and y are the angles of the triangle. The sum of angles of a spherical 
triangle is larger than z but less than 37; therefore, the angle of rotation can be 
arbitrary. 


5.15 


(a) Consider the tangent space to S? at a point p. Any direct motion of this vector 
space is a rotation about some axis. Let a be the plane perpendicular to this axis 
and passing through p. The intersection of the sphere S* with the hyperplane 
containing a and passing through the center of the sphere is the submanifold 
S? of S*. According to Theorem 5.20, the parallel transport along a loop lying 
in S* leaves fixed any vector tangent to S* at p and perpendicular to the plane 
a, and it transforms any vector in the plane a as if it were transported inside 
S?. According to Problem 5.14, we can realize rotation through any angle by a 
parallel transport along some curve on the sphere S*. On the sphere S? the same 
transport realizes the initial motion of the tangent space. 

(b) Consider the tangent space to S” at a point p. Any direct motion of this space 
can be represented as the composition of an even number of symmetries about 
hyperplanes. The composition of two such symmetries is a direct motion which 
leaves fixed a subspace of dimension n — 2. Let us show that the group Hol(S”) 
contains a subgroup consisting of direct motions leaving fixed this subspace of 
dimension n — 2. To this end, just as in the solution of part (a) of the problem, 
we take a plane a perpendicular to this subspace and consider the sphere S* 
being the intersection of S” with the subspace of dimension 3 containing a and 
passing through the center of the sphere. 


5.16 


(a) On a surface with flat metric, running round a sufficiently small loop is the 
identity map. A contraction of a loop can be represented as a sequence of 
its transformations in small domains. Therefore, on a surface with flat metric 
contractible loops correspond to the identity element in the holonomy group. 
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Fig. 5.4 Parallel transport 
along a generator of the 
fundamental group 


Vi V2 


V3 


(b) 


On the net of a cone consider parallel transport along a generator of the 

fundamental group of the cone (see Fig. 5.4a). Such a transport takes the vector 

V, to the vector V2, while on the net the vector V; is identified with V3. The 

angle between the vectors V2 and V3 equals the angle @ at the vertex of the net. 

(b) On the net of a Mobius band consider parallel transport along a generator of the 

fundamental group (see Fig. 5.4b). Such a transport takes the vector V; to the 
vector V2, while on the net V, is identified with V3. 


5.17 It is sufficient to consider the product of two Riemannian manifolds. Let y be 
a curve on a manifold M = M, x Mb, and let V be a vector field on this curve. 
We decompose the velocity vector X of y and the vector V into two components: 
X = X,+ X2 and V = V; + V2. We have 


VxV = Vy, Vi + Vy, Vo. 


Clearly, X; and X2 are the velocity vectors of the projections of the curve y on M 
and on M). Therefore, the vector field V is parallel along the curve y if and only if 
the projections of each vector V on the tangent spaces to Mj and to M2 are parallel 
along the projections of the curve y. This immediately implies the statement. 


Chapter 6 M®) 
Lie Groups peels 


In this chapter we discuss the differential geometry of Riemannian manifolds 
equipped with a group structure, i.e., the differential geometry of Lie groups with 
a Riemannian metric. On a Lie group of most interest are Riemannian connections 
consistent with multiplication, and above all, left-invariant metrics. 


6.1 Lie Groups and Algebras 


A Lie group G is a group which is simultaneously a smooth manifold on which the 
maps G x G > G and G > G given by (x, y) & xy and x + x7! are smooth. 


Problem 6.1 Prove that if the map (x, y) > x7!y is smooth, then so are both maps 


(x,y) xy andx ph x7!, 


The algebraic and geometric structures of a Lie group are involved in an object 
called a left-invariant vector field, which is important for many purposes. A left- 
invariant vector field is constructed by using left translations. The left translation by 
an element g in a Lie group is the map Lg(x) = gx. 

Any smooth map h: M — N of manifolds induces a map of certain objects 
associated with these manifolds. Moreover, objects can be transferred in two 
opposite directions. The induced map may transfer them in the same direction, i.e., 
from M to N. This is the case for vectors: a curve y on M is mapped to the curve 
h(y) on N. Insuch cases, the induced map is usually denoted by h,,. But the induced 
map may also transfer objects in the opposite direction, from N to M. This is the 
case for functions: a function f(n) on N is mapped to the function f(h(m)) on M. 
In such cases, the induced map is denoted by h*. Differential forms are transferred 
from N to M as well: a form w(X1,..., Xx), where X1,..., Xx are vectors on N, 
is mapped to the form h*w(Yj,..., Yk) = w(hsY,..., Ye), where Yi, ..., Ye 
are vectors on M. 
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A left translation, as well as any other map, induces actions on various objects 
associated with the Lie group G. For example, the action on the functions on the Lie 
group is defined by (Le A(x) = (f o Lg) (x) = f(Lg(x)) = f (gx). The action on 
the tangent vectors is defined as follows. Suppose that a tangent vector V, at a point 
x € G is determined by a curve y(t), i.e., y(O) = x and y’(0) = V,. Then Lgx Vx i8 
the tangent vector at the point gx determined by the curve gy (tf), i.e., Lex = dLg. 
These two actions are consistent with the action of vectors on functions, which is 
defined by V, f = dy, f = £f(yW)|r=o. Indeed, (Les Ve) f = $f (gy (O)l=0 = 
Villy): 

A vector field V is said to be left-invariant if any left translation leaves this vector 
field fixed, i.e., LgxVy = Vgx for all x ¢ G and all g € G. Each left-invariant vector 
field is completely determined by the vector V, at the identity element e. Indeed, the 
vector of this left-invariant vector field at any point x € G must equal Ly..Ve. And 
conversely, if we take a tangent vector V, at the identity element e and consider the 
vector field equal to L,,.Ve at the point x, then this vector field will be left-invariant. 
Indeed, Lex Vx = LyxLxxVe = LyxxVe = Vex. 


Problem 6.2 Prove that on a Lie group of dimension n there exist n vector fields 
linearly independent at each point. 


Consider the tangent space 7; G at the identity element e of a Lie group G. Let X- 
be a vector in this space, and let X be the corresponding left-invariant vector field. 
The Lie algebra g of the Lie group G is the space T,G endowed with the operation 
[Xe, Yo] = [X, Y]e; thus, to calculate the product of vectors X. and Y~ in the Lie 
algebra, we must extend them to left-invariant vector fields X and Y, calculate their 
commutator [X, Y], and take the value of this commutator at e. The multiplication 
operation [-, -] in the Lie algebra g is called commutator, or Lie bracket. 


Problem 6.3 Prove that the commutator of left-invariant vector fields X and Y isa 
left-invariant vector field. 


In addition to left translations Ly, we can consider right translations Ry (x) = xh. 
The action of a right translation on the tangent vectors is defined as follows. Suppose 
that a tangent vector W, at a point x € G is determined by a curve y(t), ie., 
y(0) = x and y’(0) = W,. Then Rp» W, is the tangent vector at xh determined by 
the curve y (t)h, 1.e., Rnx = d Rp. The vector field W is said to be right-invariant if 
any right translation leaves fixed this vector field. 

Let X. be a vector in the space TG. We denote the corresponding left-invariant 
vector field by X” and the corresponding right-invariant vector field by X*. 

It is easy to check that LgoLy = Lgn and Rgo Rn = Rng. Thus, the map g +> Ly 
is ahomomorphism of the group G to the group Aut(G), and the map g +> Rg is an 
antihomomorphism. It is also clear that Lz o Ry = Ryo Lg: each of these operations 
takes x to gxh. 

An operation on the tangent space T,G at the identity element e of the Lie 
group G can be defined by using not only left-invariant vector fields but also right- 
invariant ones. These are different operations: they differ in sign. In the solution of 


6.1 Lie Groups and Algebras 213 


Problem 6.4 this is proved for matrix Lie groups, i.e., for Lie groups consisting of 
matrices. For arbitrary Lie groups, this is proved in the solution of Problem 6.8. 


Problem 6.4 Given a matrix Lie group G, prove that the operation on the tangent 
space 7;G at the identity matrix J which is defined by using left-invariant vector 
fields is taking the commutator of matrices, and the operation defined by using right- 
invariant vector fields is taking the commutator with the negative sign. 


Problem 6.5 Prove that the differential of the multiplication 4: G x G > G is 
related to the differentials of left and right translations as (Vg, Wn) = RaxVg + 
L gx Wh. 


Consider the map Inv: G —> G given by Inv(g) = g7!. 


Problem 6.6 


(a) Prove that Inv, Vz = — g Ve g~! for a matrix Lie group G. 
(b) Prove that Inv, Vz = —(Lgx)! Ry-1,4V¢ for any Lie group. 


Problem 6.7 Let X" and X* be the left- and right-invariant vector fields corre- 
sponding to a vector X- € 7T.G. Prove that (Inv, X ys — a. : for any point 
geG. 


Problem 6.7 establishes a relationship between left- and right-invariant vector 
fields: the map Inv takes any left-invariant vector field to a right-invariant one. 


Problem 6.8 Prove that the right commutator differs from the left commutator in 
sign, ie., [X", ¥4]. = —[X®, VY? ]. 

Let F' be the flow of a vector field V on a Lie group G, which means that 
F': G > G is the displacement during time ft along a trajectory of V. 


Theorem 6.1 The flow of a left-invariant vector field is the right translation by the 
image of the identity element e, and the flow of a right-invariant vector field is the 
left translation; i.e., F' = Rprt(e) for a left-invariant vector field and F' = L Fre) 
for a right-invariant vector field. 


Proof Let V be a left-invariant vector field (the case of a right-invariant field is 
treated in a similar way). It is required to prove that F'(x) = R Ft(e)(%). Let us fix a 
point x and consider the curve 


x(t) = Rete (x) = xF'(e) = Ly (F'(e)). 


By assumption iF ‘(e) = V(F'(e)). The left invariance of V implies 
Lyx(V (F'(e)) = V(L,x(F'(e)). Therefore, x (0) = x and 


dx : . 
ra = Ly (V(F (e)) = ViLx(F' (e))) = Var). 


Thus, x(t) is the trajectory of the vector field V starting at x, i.e., x(t) = F'(x). O 
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Corollary The flow of any left-invariant vector field commutes with the flow of any 
right-invariant vector field (and, therefore, the commutator of a left-invariant vector 
field and a right-invariant one vanishes). 


Proof Let F' be the flow of a left-invariant vector field V, and let H* be the flow 
of a right-invariant vector field W. Then 


(F! o H*)(x) = (Rete) 0 Lys(e))(X) = H¥(e) +x» F'(e) 


= (Lus(e) 0 Rpt(e))(X) = (H* 0 F')(x). 


| 


A tangent vector X, at the identity element e € G determines a left-invariant 
vector field X/ and a right-invariant vector field X*. Through each point of the 
Lie group G a trajectory of the left-invariant field X" and a trajectory of the right- 
invariant field X* pass. These trajectories are generally different. But the trajectories 
passing through the identity element e coincide and determine (for sufficiently small 
t) a one-parameter subgroup, namely, the displacement of the identity element e 
during time ¢ along the trajectory. 

To prove the coincidence of the two trajectories, we use Theorem 6.1. First, we 
show that the trajectories of both vector fields are defined not only for sufficiently 
small ¢ but also for any ¢. For the flow of a left-invariant vector field, we have 
F'ts(e) = F*(F'(e)) = Rese) (F'(e)) = F'(e)- F*(e). A similar equation holds 
for the flow of a right-invariant vector field. This equation shows, in particular, that 
the flow F'(e) is defined not only for sufficiently small t, but also for any t. Indeed, 
given any t, the flow F’(e) can be specified by F'(e) = (F‘/"(e))", where n is large 
enough. The formula F’(x) = Rrtie)(x) shows that, given any f, trajectories of a 
left-invariant vector field passing not only through the identity element e but also 
through any point x are defined. 

Let F" be the flow constructed from a left-invariant vector field X/ or from a 
right-invariant vector field X*. We have proved that the map h: R > G given by 
h(t) = F'(e) is ahomomorphism. In either case, a) = X-. Differentiating the 
equations h(t + s) = h(t)h(s) and h(t +s) = h(s)h(t) with respect to s at s = 0, 
we obtain, respectively, 


il =L X.) = X4(h 

Fit) = Ena (Xe) = X" (HO) 
and 

dh 

GO) = Runa (Xe) = XR (a(t)). 


Therefore, h(t) is the trajectory of both X L and X® through h(0) = e. 
Using the one-parameter group F‘(e), we can define the exponential map 
exp: g > G by setting exp(X.) = F'(e), where F!(e) is the displacement of 
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the identity element e during time ¢ = | along the flow F’ of the left-invariant 
vector field X¥ (or of the right-invariant vector field X*) generated by the vector 
Xe. 
The differential of the map exp: g — G at the point 0 is the identity map g > g. 
Therefore, the map exp is a diffeomorphism between some neighborhood of the 
point 0 € g and some neighborhood of the point e € G. 


Example For the Lie group GL, of nonsingular matrices of order n, the Lie algebra 
is the algebra gl, of all matrices of order n, and 


A? Ak 
a a a er 


Proof It is easy to check that the function 


Act Akt 
CAC EP BE ag ea ee 
is the one-parameter subgroup corresponding to the tangent vector A. Oo 


By using the exponential map, it is possible to define multiplication in a Lie 
algebra in a different way. Let us identify a neighborhood of zero in the given Lie 
algebra and a neighborhood of the identity element in the Lie group by means of the 
map exp. Consider coordinates with origin at zero in the chosen neighborhood of 
zero; they induce coordinates in the chosen neighborhood of the identity element. 
In these coordinates multiplication in the Lie group can be decomposed as 


xy=xt+yt+ea,y)+..., 


where c(x, y) is some bilinear function and the dots denote higher-order terms. 
Indeed, we have xy = A+ Bx+Cy+.... Setting x = e, we obtain y = A+ 
Cy +..., whence A = Oand C = I. A similar argument proves that B = 1. 

The bilinear function c is also defined on the Lie algebra. The commutation 
operation in the Lie algebra can be specified by [X, Y]; = 2c(X, Y). A complete 
definition of this operation, with all identifications expressed explicitly, is as 
follows: 


1 
exp X -expY =exp(X + Y+ 5X, Yqi+...). 
It is easy the check that [X, Y]; = —[Y, X],. Indeed, 
1 
exp(0) = exp X - exp(—X) = exp(X — X + lx, —X],;+...), 


whence [X, X]; = 0. Therefore, [X + Y, X + Y]; = 0, which means that [X, Y]; + 
LY, X]; =0. 
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Theorem 6.2 For sufficiently small X,Y € g, 


exp(—X) exp(—Y) exp X exp Y = exp((X, Y]i+...), (6.1) 
exp X exp Y exp(—X) = exp(Y + [X, Y]i +...). (6.2) 


Proof Clearly, 
1 
exp(—X) exp(—Y) = exp(—X — Y+ 5X, Yjit...) =expA, 
1 
exp X expY = exp(X + Y+ 5lX, Yuit...) =expB, 


1 
exp A exp B = exp(A+ B+ anh +...). 


Moreover, 

1 1 
A+B=-X-Y4sIX, Yn t- 4 X$ V+ 51K Vit = 1K Yh +... 
and[A, B]} =[—X —Y,X + Y],+---=0+.... Therefore, 


exp(—X) exp(—Y) exp X exp Y = expAexpB 
1 
=A BA Bliss) 


= exp([X, Y]i +...). 
Next, 
1 
exp X exp Y exp(—X) = exp B exp(—X) = exp(B — X — 5lB, X), +...). 


Moreover, B— X = Y + [X,Y] +... and[B,X]) =[X+Y¥,Xh+--- = 
[Y, X]; +.... Therefore, 


1 1 
exp X exp Y exp(—X) = exp(Y + 5X, Yi; - aly, X|, +...) 


=exp(Y + [X,Y], +...). 
a] 


Now the equivalence of the two definitions of commutator in a Lie algebra, i.e., 
the identity [X, Y]; = LX, Y], follows directly from Theorem 6.3. 
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Theorem 6.3 For left-invariant vector fields X and Y, 
exp(—tX) exp(—tY) exp(tX) exp(tY) = exp(t?[X, Y|J+ o(t*)). 


Proof We use Theorem 5.27 (see p. 201), taking the identity element e for p. In this 
case, we have X;p = pexp(tX) for p € G, and therefore 


o(t) = exp(—tX) exp(—tY) exp(tX) exp(tY). 
For a function f on the Lie group with f(e) = 0, Theorem 5.27 gives 


f (exp(—tX) exp(—tY) exp(tX) exp(tY)) = t atx. Y] fet o(t?) 
= f(exp@LX, Y] + o@))), 


which implies the required equation. Oo 


Corollary For left-invariant vector fields X and Y, the velocity vector of the curve 
exp(—V1X) exp(—V1tY) exp(/1tX) exp(V/tY) 


at t = 0 equals [X, Y]. 


For a right-invariant vector field, we have X;p = exp(tX) p, so that 
exp(tY) exp(tX) exp(—tY) exp(—tX)) = exp(t?[X, YJ+ o(t?)). 
For right-invariant vector fields X and Y, the velocity vector of the curve 
exp(V/tY) exp(/tX) exp(—V1Y) exp(—VtX) 


at t = 0 equals [X, Y]. 


Example The Lie algebra of the group of nonsingular matrices is the algebra of 
matrices with commutator [X, Y] = XY — YX. 


Proof We must show that the decompositions of exp X exp Y and exp(X + Y + 
5(X Y — YX)) coincide up to second-order terms. It is easy to check that, up to 
second-order terms, both of them equal 


xX? y? 
I+X4+Y¥+—+XY4+—. 
2 2 

oO 


HISTORICAL COMMENT The theory of Lie groups and algebras was developed 
by Sophus Lie (1842-1899). He came to this theory through the study of partial 
differential equations. He hoped to find a theory of such equations similar to Galois’ 
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theory of algebraic equations. In 1873-1874 Lie began to develop the theory of 
continuous transformations of groups, already independently of his initial goal. 


6.2 Adjoint Representation and the Killing Form 


The map I, = R,-10 Lg = Lgo R,-1, which takes x to gxg! 


of a Lie group G. Such an automorphism is said to be inner. 

An adjoint representation of a Lie group G is the map Ad: G — Aut(g) defined 
by Ad, = (d/g)¢, that is, the differential of an inner automorphism at the identity 
element. 

For a matrix Lie group, Ad4(X-) = AX,A, Indeed, the map /,4 takes each 
curve X(t) representing a tangent vector X, to the curve AX(t)A7~!. The tangent 
vector to this curve at t = 0is AX,A~!. 

It is easy to check that 


, iS an automorphism 


gexp(tXe)g | = exp(t Adg(Xe)). (6.3) 
Indeed, both maps t + g exp(tX,)g7! and t +> exp(t Adg(X-)) are homomor- 
phisms, and their derivatives at t = 0 equal Ad,(X,). One-parameter subgroups of 
a Lie group having the same tangent vectors at t = 0 coincide. 


Problem 6.9 Prove that, for matrices, Ae’* A~! = e! AXAT! 


An adjoint representation of a Lie group G is a group homomorphism (a 
homomorphism from the Lie group G to the automorphism group of the Lie algebra 
g). Indeed, differentiating the equation Igh = Ig o J, by the chain rule, we obtain 
Aden = Adg o Ad. 

An adjoint representation of a Lie group relates the left-invariant vector fields to 
the right-invariant ones as follows: the left translation of a vector X. € T.G toa 
point g equals the right translation to this point of the vector Adg(X-). Indeed, we 
have LgxXe = Rex Ry-i,LgxXe = Rex(Re-1ybexXe) = Rex(Adg(Xe)). 

The adjoint representation of a Lie algebra g is the homomorphism ad of the Lie 
algebra g to the Lie algebra of endomorphisms of the space g defined by ady = 
(d Ad),(X). For example, in the case of a matrix Lie group, as a curve representing 
a tangent vector X € T.G we can take e’*. Then 


d d 
ady(Y) = —(Adyx(¥))) = —(e*Ye"*)| = XY -—YX =[X,Y]. 
dt <0 at 1=0 
Let us prove this formula in the general case. 


Theorem 6.4 The identity ady(Y) = [X, Y] holds for all X,Y € g. 
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Proof Using the decomposition of Theorem 6.2, we obtain 
Adexpirx) (exp(sY)) = exp(tX) exp(sY) exp(—1tX) = exp(sY + st[X,Y]+...). 


Therefore, 


dd 
adx (¥) = 7 7 Adexpirx) (exp(s¥)) 


s=0!t=0 


d 
= explo + stl, Y]+...) 


s=01t=0 
aie Fi 


d 
= —(Y+?[X,Y]+...) 
dt 1=0 


oO 


Problem 6.10 Given an X ¢€ g, prove that Adexp x = exp(adx), where exp X is 
the exponential map from the Lie algebra to the Lie group and exp(adx) is the 
exponential of a linear operator. 


The Killing form is the symmetric bilinear form B(X, Y) on a Lie algebra g 
defined by B(X, Y) = tr(ady oady). Bilinearity follows from the linearity of the 
map X +> adx, and symmetry follows from tr(A B) = tr(BA). 


Theorem 6.5 The Killing form has the following properties: 


(a) Each operator Adg is orthogonal with respect to B, ie, B(X,Y) = 
B(Ad,(X), Adg(Y)) forall g ¢ Gand X,Y €g. 

(b) Each operator adz is skew-symmetric with respect to B, i.e., B(adz(X), Y) = 
—B(X,adz(Y)) (this identity can also be written as B([X,Z],Y) = 
B(X,[Z, Y])). 


Proof 


(a) If o: g — g is an automorphism of a Lie algebra g, ie., o is a linear 
isomorphism and o[X,Y] = [oX,oY], then adgyoo = o 0 ady, ie., 
adgx = 0 oady oo !. Leto = Adg. Then 


B(Adg(X), Adg(Y)) = tr(adaa, (X) 0 adaa, (Y)) 
= tr(Ady oady 0 Adz! o Adg oady 0 Ad; ') 
= tr(ady oady) = B(X,Y). 
(b) Twice applying the Jacobi identity, we obtain 


[Z, LX, [Y, WI] = [[Z, X], [(Y, WI) + (x, [Z, LY, W))] 
= [[Z, X], YY, WI) + (X, 1Z, Y], WI) + LX, [Y, [Z, WI), 
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whence 
adz oady oady = adagy(x) 9 ady + ady o adagy(y) + ady oady oadz, 
1.€., 
[adz, ady oady] = adag,(x) 0 ady + ady 0 adaaz yy - 


Since tr([A, B]) = 0, it follows that B(adz(X), Y) + B(X, adz(Y)) = 0. oO 


HISTORICAL COMMENT Wilhelm Killing (1847-1923) classified semisimple Lie 
algebras between 1888 and 1890. For studying Lie algebras, he introduced a 
symmetric bilinear form on a Lie algebra; this form has subsequently become known 
as the Killing form. 


6.3 Connections and Metrics on Lie Groups 


In the context of Lie groups of special interest are connections and metrics consistent 
with the group structure in one sense or another, e.g., left-invariant connections and 
metrics. First, we discuss connections which are constructed without employing 
metric and may have nonzero torsion, and then we proceed to the Levi-Civita 
connections constructed for various metrics. 

On a Lie group G a canonical left-invariant covariant differentiation 7 V is 
defined. Its curvature tensor is zero, while torsion is not always zero. The one- 
parameter subgroups of the Lie group correspond to geodesics through the identity 
element. We can also define a symmetric covariant differentiation which leads to the 
same geodesics. 

The canonical left-invariant covariant differentiation _V is defined by setting 
LVxY = 0 for any left-invariant vector field Y (and any, not necessarily left- 
invariant, vector field X). This completely determines covariant differentiation. 
Indeed, any vector field Z on G can be represented as a linear combination of 
the basis left-invariant vector fields e;: Z = f'e;. Covariant differentiation obeys 
Leibniz’ rule; therefore, 


LVxZ = LVx fie: = firVxei + (Ox fe: = (Ox fei. 


For the canonical left-invariant covariant differentiation, the curvature tensor 7, R 
is zero, and the torsion tensor ,T(X, Y) equals —[X, Y], provided that the vector 
fields X and Y are left-invariant. To prove this, note that any vectors Xp, Yp, and 
Zp given at a point p can be extended to left-invariant vector fields X, Y, and Z. 
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Therefore, in calculating the curvature and torsion tensors at a given point, we can 
take left-invariant vector fields, and then all covariant derivatives vanish: 


LT (X,Y) =LVxY — Vy X —[X, Y] = -[X, Y], 
LR(X,Y)Z = (LVx)(LVy)Z — Vy )(LVx)Z — LVpx,y]Z = 0. 


For the canonical left-invariant covariant differentiation ; V on a Lie group G, 
an integral curve y(t) of a left-invariant vector field X is geodesic. Indeed, we have 
LVy'v' = LV,'X = 0. The curve gy(t) is an integral curve of X as well: 


(gy (t))' = (Lg)ay'(t) = (Le)eX (vO) = X(gy (0). 


At t = O the curve y(t) = y(0)~'y(t) passes through the identity element e. 
The geodesics y.(t) through the identity element e are one-parameter subgroups. 
Indeed, for a fixed fo, we have ye(to) Ye(t) = y(to + t), because both curves are 
integral curves of the vector field X, so that they coincide at t = 0. It follows, in 
particular, that the curve ye(t) can be extended without bound, i.e., it is defined for 
all t. Any geodesic has the form gye(t); therefore, any geodesic can be extended 
without bound. 
A symmetric covariant differentiation V on a Lie group G can be defined as 


1 
VxY=LVxY - zLT (x, Y). 


Theorem 6.6 


(a) The geodesics with respect to the covariant differentiations V and ,V coincide. 

(b) The torsion tensor T of the connection V is zero. 

(c) The curvature tensor R for left-invariant vector fields X, Y, and Z is calculated 
by 


R(X, Y)Z= lz. [X, Y]]. 


Proof (a) Clearly, V,y’ = LV,y’, because T(y’, y’) = 0. 

(b) and (c) The curvature and torsion tensors can be calculated by using left- 
invariant vector fields. For left-invariant vector fields X and Y, we have, by 
definition, 


1 1 1 
VxY =LVxY — BLT (x, Y)= ate, Y)= rie Y]. 
Therefore, 
T(X,Y) =VxY —VyX -—[X, Y] 
1 


1 
= 51X,¥]—51¥. X1-[X, ¥1=0. 
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Next, 


R(X, Y)Z = VxVyZ — VyVxZ — Vix,y|Z 


To derive the last equality, we have used the Jacobi identity. oO 


A left-invariant Riemannian metric on a Lie group can be constructed as follows. 
We choose a Euclidean metric (positive definite inner product) on the tangent space 
at the identity element (i.e., on the Lie algebra) and spread it over the entire group 
by using the left translations Ly: (LgxX, Lgx¥)g = (X, Y)e. For such a metric, any 
left translation is an isometry. Indeed, if follows directly from the definition that Lg, 
is an isometry from T.G to T,G and L,-1, is an isometry from TgG to T.G. The 
map Lg+|n: ThG — TgnG can be represented as the composition of the isometries 
TnG —> T.G and T.G — T,nG; therefore, it is an isometry as well. 

A Riemannian metric on a Lie group G is said to be bi-invariant if it is both 
left-invariant and right-invariant. 


Theorem 6.7 A left-invariant metric on a Lie group G is bi-invariant if and only 
if, for each g € G, the map Adg preserves the restriction of this metric to the Lie 
algebra g. 


Proof First, suppose that a metric on a Lie group is both left-invariant and right- 
invariant. Then both maps Lg and R,-1 are isometries of the Lie group. Their 
composition I, = Lg o R,-1 1s an isometry of the Lie group as well. The map 
Adg is the differential of Ig at the identity element, and hence the adjoint action on 
the Lie algebra is isometric. 

Now suppose that a metric on a Lie group is left-invariant and the adjoint action 
on the Lie algebra preserves this metric. Let us prove that the right translations 
preserve it as well. The map of the tangent space at a point h induced by the right 
translation Rg can be represented as the composition of the map of the tangent space 
at the identity element e induced by this right translation and the inverse of a map 
of the same form: 


(Re)eln = (Ragale o (Ridele) +. 


Indeed, we can map the curve y(t) for which y(0) = A first to the curve y(t)h7! 
and then to the curve (y (t)h-!)h g = y(t)g. Therefore, it suffices to prove that the 
right translation induces an isometry of the tangent space at the identity element. 
This is evident from the representation 


(Rg)x = (Lg)x 0 Adz! 
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Thus, the left-invariant metrics on a Lie group G are in one-to-one correspon- 
dence with the inner products in its Lie algebra g, and the bi-invariant metrics on G 
are in one-to-one correspondence with the Ad-invariant inner products in g. 


Theorem 6.8 Jf an inner product in a Lie algebra is Ad-invariant, then 
([X, Y], Z) = —(Y, [X, Z]) forall X,Y, Z eg. 


Proof The Ad-invariance of the given inner product implies 
(Adg(Y), Z) = (Adg(Y), Adg Ad,-1(Z)) = (Y, Ad,-1(Z)). 


Therefore, 


d 
(LX, Y], Z) = (adx (VY), Z) = (— Adexprx (Y) ,Z) 


t=0 


d d 
a Adexpt x (Y), Z) = a Adexp(—1x)(Z)) 


t=0 
(Y, — adx(Z)) = —(Y, [X, Z]). 


t=0 


oO 


Theorem 6.8 can also be formulated as follows: If all maps Adg are orthogonal 
with respect to some inner product in a Lie algebra, then all maps ady are skew- 
symmetric with respect to this inner product. For a connected Lie group, the 
converse is also true; we leave its proof to the reader as a problem. 


Problem 6.11 Prove that if a Lie group is connected and all maps ady are skew- 
symmetric, then all maps Ad, are orthogonal. 


Problem 6.12 Prove that, for a Lie group G with a bi-invariant metric, the map 
Inv: G > G defined by Inv(g) = g7! is an isometry. 


6.4 Maurer—Cartan Equations 


Consider a basis in a Lie algebra g. It consists of tangent vectors at the identity 
element e € G. Extending these vectors to left-invariant vector fields, we obtain a 
basis X; of left-invariant vector fields. It is associated with a basis w in the dual 
space: wi (Xj) = 5. 

The elements of the space dual to the space of left-invariant vector fields are 
differential 1-forms constant on each left-invariant vector field. Indeed, if a 1-form 
@ lies in the dual space, then it can be written as wm = a;w', and a left-invariant 
vector field can be written as X = b/X j (aj and b/ are some constants). Therefore, 
w(X) = ajb/ 5i = ajb'. Conversely, if a 1-form w is constant on each left-invariant 


vector field, then w(X;) = aj, whence w = aja! . 
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In a different way, the elements of the space dual to the space of left-invariant 
vector fields can also be described as left-invariant differential 1-forms on a Lie 
group. Before defining a left-invariant 1-form, we recall how maps act on 1-forms. 
Let f: M — N bea smooth map. If X is a vector field on M and m is a 1-form 
on N, then f,.X is a vector field on N, and f*w is the 1-form on M defined by 
(f*@)(X) = w(f,X). Thus, the value of the form f*@ at a vector X € T, M equals 
the value of the form w at the vector f, X ¢ Ty(,)N. 

A differential 1-form w is left-invariant if the left translation L takes it to itself: 
Leo = a, i.e., the value of the form w at a vector X € 7;,G equals its value at the 
vector LgxX € TyxG. 

Given a left-invariant vector field X, its value at a point gx equals the value of the 
vector field L,X at gx. Therefore, for a left-invariant 1-form @ and a left-invariant 
vector field X, m(X) takes the same value at all points. Now suppose given a 1-form 
w such that w(X) is constant for any left-invariant vector field X. Then the value of 
w at the vector Ly.X € Tg,G equals the value of w at X € 7,G, and therefore the 
form o left-invariant. 

The Maurer—Cartan form on a Lie group G with Lie algebra g is the left-invariant 
g-valued differential 1-form @ on G which takes the constant value X,¢ at each left- 
invariant vector field X (X¢ is the value of X at the point e). At an arbitrary vector 
X,g € T,G the form a takes the value (Lg-1)xX g € T.G. 

In addition to the left-invariant Maurer—Cartan form w!, for which w! (X a= 
(L go! )»Xg, we can consider the right-invariant Maurer—Cartan form awk , for which 
oo (Ky) = (R-t)s Xe. 

Using the Maurer—Cartan form @ (left or right), we can explicitly specify a 
trivialization of the tangent bundle 7;,G (i.e., a mapping 7;,G — G x g). Namely, 
to each tangent vector V, € T*G we can assign the pair (g, w(V¢)). 


Problem 6.13 Consider the composition of the map G x g — T,G inverse to 
the left trivialization and the right trivialization T,G —> G x g. Prove that this 
composition is the map (g, V) + (g, Adg(V)). 


Problem 6.14 Suppose given a basis X; of the space of left-invariant vector fields 
and a basis w! of the dual space. Let X¢; be the vector of the field X; at the point e. 
Prove that X.; ® w! = w (the Maurer—Cartan form). 


For a matrix Lie group G, the Maurer—Cartan form wg at a point g = (xi) can 
be written as g~'dg = (xi) ld (x4), where dg is the identity map of the tangent 
bundle. Indeed, we have wg (Vz) = Ly-14V¢ — g! Vy. 

If both vector fields X and Y are left-invariant, then w([X, Y]) = [X,VY]e = 
[Xe, Ye] = [w(X), w(Y)]; in the last two expressions, the brackets denote commu- 
tator in the Lie algebra. 

The differential of the 1-form w can be expressed in terms of its derivatives in 
directions of vector fields and the commutator as follows (see Appendix, p. 259): 


dw(X, Y) = dx(@(Y)) — dy(@(X)) — w([X, Y]). 


6.5 Invariant Integration on a Compact Lie Group 225 


For left-invariant vector fields, dx(@(Y)) = dy(@(X)) = O (the derivative of a 
constant quantity vanishes); therefore, 


dw(X, Y) = —w([X, Y]) = —[o(X), o(Y)]. 
To derive the equation 
dw(X, Y) + [w(X), o(¥)] = 0, (6.4) 


we used the left-invariance of the vector fields X and Y. To calculate the commutator 

of vector fields, we need the whole fields, rather that their values at a point. But the 

left-hand side of the resulting Eq. (6.4) depends only on the values of vector fields 

at a point. Therefore, this equation holds for any vector fields X and Y, rather than 

only for left-invariant ones. Equation (6.4) is called the Maurer—Cartan equation. 
For a matrix Lie group, we have (d w)', =d (o') and 


[o(X), OY], = (@(X)@(Y) — o(Y)o(X)); 
= 0(X),o(Y) — oY ),o(X)i 
= (Wj A §)(X, Y). 


Therefore, for a matrix Lie group, the Maurer—Cartan equation can be written in the 
form dw = —w Aw. 


HISTORICAL COMMENT Ludwig Maurer (1859-1927) obtained the Maurer— 
Cartan equation in 1899. Elie Joseph Cartan (1869-1951) obtained this equation in 
1904 and applied it in the method of moving frame, which he began to develop in 
1910. 


6.5 Invariant Integration on a Compact Lie Group 


In this section we use some basic facts concerning the integration of n-forms over 
n-dimensional manifolds. 

The volume form on a manifold M” is a nowhere vanishing n-form w. This 
form determines the orientation of M”. The volume form can be integrated over 
the manifold; i.e., for the volume form @, the integral £ mn @ can be defined. This 
integral obeys the change-of-variable rule: if g: M" — M” is a diffeomorphism of 
the manifold M”, then 
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where the plus sign corresponds to an orientation-preserving diffeomorphism and 
the minus sign, to an orientation-reversing one. 

Using the volume form, we can integrate functions on a manifold: /[- mti= 
if mn f @. When the volume form @ is replaced by ca, the integral of the function is 
multiplied by |c| (for negative c, orientation switches together with the sign of the 
form against which f is integrated). 

We will be interested in the case where the manifold M” is a compact Lie group 
G. All n-forms on an n-dimensional space are proportional, and hence there exists 
a unique, up to proportionality, left-invariant n-form on an n-dimensional Lie group 
G. On a compact Lie group G a left-invariant volume form for which /, Ge=l 
is determined up to multiplication by +1. We choose one of the two such forms and 


set 
[ t@ae= | r= f fo 


for every smooth function f. 


Theorem 6.9 The measure dg on a compact Lie group is left- and right-invariant, 
Lé., 


[ sasyae= f remas= [reas 
G G G 


foranyh eG. 


Proof The left invariance of the measure dg follows from that of the form w. To 
prove it, we need the equation (f o Ln) (Lio) = Li (fo), which holds because 
both these forms take the value f(hg)w(Lny»Xg) at the vector X,. Thus, 


[ fasas= [retme= fire tno) 
G G G 


= [| ticfey= | fo= f reas. 
G G G 


In addition to the equation given above, we have also used the change-of-variable 
formula (clearly, Ly, is an orientation-preserving diffeomorphism, because it pre- 
serves the volume form). 

Now let us prove the right invariance of the measure dg. First, we note that 
the transformations Lz and R, commute and the left-invariance of w implies 
that of Roo. The left-invariant form is unique up to proportionality; therefore, 
Roo = c(g)@, where c(g) is a nonzero real number. Since Rg o Ry = Ryg, it 
follows that the map g +> c(g) is a homomorphism from the compact group G to 
the multiplicative group of real numbers. A compact subgroup of the multiplicative 
group of real numbers consists of either the unique element | or the two elements 
+1; hence |c(g)| = 1. The map Rj, preserves orientation if and only if c(h) > 0. 
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Thus, 
/ fGWies / GoRposeds | (f 0 Ry)(Rkco) 
G G G 


= c(h) / Ri (fo) = c(h) sgn(c(h)) ii fo = / fo. 
G G G 


oO 


Example A l\eft-invariant n-form on a noncompact Lie group of dimension n may 
not be right-invariant. 


Proof Consider the group of affine transformations of the straight line T(q,,)(¢) = 
a+ At, where A 4 0. We have 
Ta nlow = Tarn + ut) =a+Arb+ rut, 


whence (a, 4)(b, “) = (a + Ab, Ap). 
Let us introduce coordinates (x, y) on this group and consider the 2-form w = 
— Clearly, 


Leary(%, y) = (4, A)(x, y) = (a + Ax, Ay), 
Roary, y) = (%, ya, 4) = & + ya, ya). 


* — (Adx)A(dy) _ * _ (dx+ady)A(Ady) __ 1 : 
Therefore, L(, 2 = ae and Ri, 2 = aye Orbe, 
the 2-form w is left-invariant but not right-invariant. Oo 


In the proof of Theorem 6.9 we defined a homomorphism c: G —> R for a Lie 
group G with a left-invariant form w by Ro = c(g)w. Given a compact connected 
Lie group G, we have c(g) = | for all g € G (for a disconnected compact Lie 
group, c(g) = +1). For the group of affine transformations of the straight line, 
c((a,A)) = f. 

In the proof of Theorem 6.10 we use the fact that a closed subgroup of a Lie group 
is itself a Lie group. A proof of this fact can be found in the book [Ad, pp. 17-19]. 


Theorem 6.10 On a Lie group G a bi-invariant Riemannian metric can be defined 
if and only if the set Ad(G) (the image of G under the map Ad) is relatively compact 
in GL(Qg) (i.e., the closure of this set is compact). 


Proof First, suppose that there exists a bi-invariant Riemannian metric on the Lie 
group G. Then, according to Theorem 6.7, the map Adg is orthogonal with respect 
to this metric for each g € G. Therefore, the set Ad(G) lies in the group O(g) of 
orthogonal transformations of the space g. This group is compact, and therefore so 
is the closure of Ad(G). 

Now suppose that the closure of the set Ad(G) is compact. By virtue of 
Theorem 6.7, it suffices to construct an Ad-invariant inner product in the space g. 
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Indeed, the left-invariant Riemannian metric determined by this inner product is also 
right-invariant. 

The closure of the set Ad(G) in GL(g) is a compact Lie group H. The group H 
acts on g, i.e., for any h € H and X ¢€ g, an element hX €¢€ g is defined. Choose 
an arbitrary inner product g(X, Y) in g and its average over H, i.e., consider the 
following inner product in g: 


(X,Y)= / (AX, hY)dh. 
H 


The invariance of integral follows from that of this inner product with respect to the 
action of any element t € H: 


ox.) = | (thx, nvyan = [ (hX, hY)dh = (X,Y). 
H HA 


In particular, this inner product is invariant with respect to the action of any 
element Ad, € H. Oo 


Corollary On any compact Lie group G a bi-invariant Riemannian metric can be 


defined. 
Proof The image of a compact group G under the map Ad is compact. Oo 


Problem 6.15 Prove that, for a compact Lie group G, the Killing form is negative 
definite. 


6.6 Lie Derivative 


The Lie derivative is the operator of differentiation of a tensor field along a vector 
field. The Lie derivative, as opposed to covariant derivative, does not depend on 
a Riemannian metric. In the next section, we will need the Lie derivative of a 
Riemannian metric. 

A vector field X on a manifold M” determines a transformation X, of M” for 
sufficiently small ft, namely, the displacement during time ¢ along an integral curve 
of the vector field X. To calculate the derivative of a tensor field S in the direction of 
a vector field X, we need some method for transporting the value of S in the space 
of tensors at a point X;(x) to the space of tensors at the point x = X g(x). Vectors 
are transported in same direction in which the map acts, and differential forms are 
transported in the reverse direction. Therefore, in the case of a vector field, we can 
use the transformation (X_,;), for transporting spaces of tensors, and in the case of 
a differential form, we can use the transformation (X;)*. 


6.6 Lie Derivative 229 


In the general case, to a tensor 


0 0 . ; 
= Gon 3 — Ji oe Jq 
Oxi ax'P Se ee 


we apply the transformation 
e 2 
(Xa @- @ Xe FT @ (Ki) “dx! @ +++ @ (Xi)*dx", 


We denote this transformation of a tensor S by X;S. For functions, we set 
(X* f)(x) = f (Xa). 

At a point x the Lie derivative (LxS)|x of a tensor field S in the direction of a 
vector field X is defined by 


CESS in OS OD 8 pean) 2 
t>0 t dt t=0 


Using this definition, we calculate the Lie derivative first for a function and then 
fora veel field and a 1-form. We assume that local coordinates x' are chosen and 


X = Xx! oa in these coordinates. 


The Lie derivative of a function is the directional derivative: Ly f = dxf. 
Indeed, 
af  d(Xr(x))! Of ¥j 
Lxf = FF o0)] = me eee ee 
dt 1=<0 Ox 


The Lie derivative of a Neclot field is the commutator of vector fields: LyY = 


[X, Y]. Indeed, if Y = Y'—, then 
i a ax!, a 
= | YO) —— 
1=0.0=— et Ox! Ox! rae 


ax!’ 
d . 0 
LxyY = —(X_1)« (5) 
dt x 
' ; oan a 8 ax 
Since Xo is the identity transformation, it follows that ; - 


=6 : . Moreover, 


—X. Therefore, 


= (x ¥ — =[X,Y]. 
( ax* “a Lt ¥] 
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Problem 6.16 Let X be a left-invariant vector field on a Lie group, and let Y be 
any vector field. Prove that 


Rexp(— Y, —Y, 
[X.Y] = lim (Rexp(—1X.))* Yexp(r Xe) e 
t-0 t 
The Lie derivative of a differential form w = oF dx/ is 


00; : ax! ; 
Lxyw = (Sx + a) dx! 


This is proved in approximately the same way as in the case of vector fields; the 
only substantial difference is that the proof uses the relation £Xx | <0 = X- By 
definition, 


ee “(x)" (wjdx') 


j 
a (~ (X4(x)) 2% ist) 


t=0 dt axk 


t=0 


ax} j d 
We also have eg = 6; and += X,|,_o = X. Therefore, 
t=0 
COP. dh ace axi doj .; , ax! j 
Lx = gt 6, dx Bo rt ae = age + Wa dx’. 


In calculating LyS, is it sometimes more convenient to deal with series 
expansions rather than with derivatives, i.e., take the following expression for the 
definition of the Lie derivative: 


XP S(X1(x)) = S(x) + t(LxS)|x + o(f). 


Let us calculate, e.g., the Lie derivative of a vector field Y in the direction of a vector 
field X by using the series expansion. For this purpose, consider the expression 


XPY(Xi(x)) = V(x) + t(LxY)|x + 0(0), 


where X;* = (X_,)«. We apply the operation (X;),. to both sides of this expression 
and take the first summand on the right-hand side to the left-hand side: 


Y (Xi) — (Xp)aY = t(X1)x(LxY) + of). 
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Consider the derivative of a function f in the direction of the vector field Y(X;) — 
(X1)4Y: 


9y(X)—(X)al f= Ov xy f — Wx) f 
= (dy f) 0 X; — dy(f o Xr) 
= dy f + tox dy f + o(t) — dy(f + tox f + o(t)) 
= t(0x dy f — dy dx f) + oft). 


Thus, we again obtain the formula Ly Y = [X, Y]. 
If T is a tensor of type (0, &), then, given any vector fields X, Yj,..., Yx, we have 


k 
(ExT), -- + Ye) = OxT M1, Ye) — DET EXYi--s Yad 
i=1 
In particular, the Lie derivative of a metric tensor has the form 
We have already proved this relation for a tensor of type (0, 1), i-e., for a 1-form. 


Now we give a proof using the series expansion. In calculating the Lie derivative of 
a vector field by using the series expansion we have obtained the equation 


Y(X1) = (Xi)aY + 1t(X1)a(LxY) + o(f). 
This equation implies 


(Xi)*T)(Y) = T(Xr)a¥) = TY (Xr) — t(X1)a(LxY)) + 0(0) 
= T(Y) 0X; — tT ((Xi)x(LxY)) + 0) 
= T(Y) + tax(T(Y)) — tT (Xt )x(LxV)) + o(0). 


Therefore, 


X,)*T)\(Y) —-T(Y 
(LxT)(¥) = lim “ t) ae Y) 


= lim (x (PY) — T((Xt)x(Ex¥))) 
= dx(T(Y)) — T(LxY). 


The general case differs only in that the notation is more complex. 
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HISTORICAL COMMENT Sophus Lie introduced only a special case of the Lie 
derivative, the commutator of vector fields. The Lie derivative in the general case 
was introduced by Wtadystaw Slebodziriski (1884-1972) in 1931 . 


6.7 Infinitesimal Isometries 


A vector field X on a Riemannian manifold M is called a Killing field if 
Lxg(Y, Z) = 0 for any vector fields Y and Z, i.e., the Lie derivative of the metric 
tensor in the direction of X vanishes. Recall that the Lie derivative of the metric 
tensor has the form 


Lxg(¥, Z) = dxg(¥, Z) — g([X, Y], Z) — g(Y, [X, Z]) 


(see p. 231). In terms of the Levi-Civita connection V, the equation Ly g(Y, Z) = 0 
can be written as 


e(Vy X, Z) + 8(Y, VzX) =0 


for any vector fields Y and Z. Indeed, the Levi-Civita connection is torsion-free and 
compatible with the metric, so that 


dx g(Y, Z) = g(VxY, Z) + g(¥, VxZ) 
= g(VyX + [X,Y], Z)+ 9(¥, VzX + [X, Z]). 

Therefore, 

Thus, if X is a Killing field, then VX (Y) = Vy X is a linear operator of Y skew- 
symmetric with respect to the metric g. 

A vector field X on a manifold M” is a Killing field if and only if the local 
one-parameter group of transformations X; (displacements by ¢ along integral 
trajectories of X) consists of isometries. Indeed, if all transformations X; are 


isometries, then g(X;Y, X/Z) = g(Y, Z), and therefore Ly g = 0. Now suppose 
that Ly g = 0. Then 


1 
lim —(g(XFY, X*Z) — 9(¥, Z)) =0. 
t>0 ¢ 
Since X;X5 = X;+45, we have 


. 1 
tim = (8 (X$4,¥, X$4,Z) — g(XFY, X$Z)) = 0. 
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Therefore, the map s + g(X*Y, X*Z) is constant, which means that Xs is an 
isometry. 
In view of this, Killing vector fields are also called infinitesimal isometries. 


Theorem 6.11 The commutator [X, Y] of Killing fields X and Y is a Killing field. 


Proof Recall that [X,Y] = LxyY = £((X—1)x¥ (X1(x)))|, 9: Therefore, for 
a fixed ft, the diffeomorphisms X_,Y,X;, generate the vector field [X,Y]. The 
diffeomorphism X _; Y, X; is a composition of isometries, and hence it is an isometry 
as well. It follows that 

a2 


L = —(X_,Y,X,)* = 0. 
[X,Y18 aaDe 1Y5Xt) §| 


Thus, the Killing fields form a Lie algebra. 


Problem 6.17 Prove that the right-invariant vector fields on a Lie group with a left- 
invariant metric are Killing fields. 


A Killing field X on a manifold M” is a Jacobi field along any geodesic on 
this manifold. Indeed, a Killing field X generates a local one-parameter group of 
isometries. Isometries take geodesics to geodesics. Therefore, any Killing field X 
generates a variation (with loose endpoints) of a geodesic in the class of geodesics. 

Recall that a Jacobi vector field J on a geodesic y is completely determined by 
the initial conditions J(0) and Vy J(0), where V = y’ is the velocity vector of the 
geodesic. Therefore, to define a Killing field, it is sufficient to specify its value at 
one point and the values of its covariant derivatives in all directions at this point. 
In particular, if a Killing field vanishes together with its covariant derivatives in all 
directions at some point, then it vanishes everywhere. 

The operator VX(Y) = VyX is a linear skew-symmetric operator of Y. 
Therefore, to define a Killing field, it suffices to specify a vector and a skew- 
symmetric operator. It follows that the dimension of the Lie algebra of Killing fields 
on an n-dimensional Riemannian manifold is at most n + ee) = merh 

The isometry group of a Riemannian manifold has many nice properties. 
This group is always a Lie group (by the Myers—Steenrod theorem, 1939). A 
generalization of this theorem was proved in the book [Pa]. In the same book it 
was proved that the natural action of the isometry group on the manifold is smooth 
and, moreover, a homomorphism f from the group of real numbers to a Lie group 
of isometries is smooth, provided that so is the map (f, p) +> B(t)p. We will not 
prove these theorems; instead, we will assume that all Riemannian manifolds under 
consideration have all these properties. 

Under this assumption, we establish a relationship between the Lie algebra of the 
isometry group of a Riemannian manifold M” and the Lie algebra of Killing fields 
on this manifold. 
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We use the following notation: G is the Lie group of isometries of a Riemannian 
manifold M” and g is its Lie algebra; X and Y are left-invariant vector fields on 
the Lie group G; and X, and Y, are the vectors of these vector fields at the identity 
element, in particular, X- and Y, are elements of the Lie algebra g. 

To an element X-¢ € g we can assign the one-parameter subgroup exp(fX¢) in G. 
The isometry exp(tX-) of the manifold M” takes each point p € M” to the point 
exp(tX-)p € M”. We associate the element X¢ of the Lie algebra g with the vector 
field a — 4 (exp(tXe) P) | ;—-9 on the manifold M". The one-parameter group X;" 
of displacements during time ¢ along integral trajectories of the vector field XT 
consists of the isometries exp(tX_), so that X* is a Killing field. 

For each point p € M”, we consider the map 7): G — M” that takes every 
element g € G to the point g(p) € M”. It is easy to check that (77p).Xe = Ae 
Indeed, the vector X- € TeG can be represented by a curve y(t) = exp(tX-) for 
which y’(0) = X¢. Therefore, the vector (p)xXe € T,)M" can be represented by 
the curve (7p 0 y)(t) = y(t)p = exp(tXe)p, ie., it equals £(exp(tXe)P)|,-o = 
ae 

Let us show that the map X, +> XT is a one-to-one linear map of the Lie algebra 
of the Lie group of isometries of the manifold M” to the Lie algebra of Killing 
fields on M”. This map is an anti-isomorphism with respect to the structure of the 
Lie algebra, i.e., [Xt, YT] = —[X, Y]*. 

The linearity of the map X, +> X* follows from (Tp)aXe = yu Let us check 


that it has trivial kernel. Indeed, suppose that a vector field XT is zero. Then its 
integral curves exp(tX.)p are constant for all p € M”. Therefore, exp(tX¢) is the 
identity element of the Lie group for all t, which means that X, = 0. Now let us 
check that any Killing field Z can be represented in the form Z = X* for some 
X.~ € g. The Killing vector field Z determines the one-parameter isometry group 
Z;. The one-parameter subgroup t +> Z, of the Lie group G corresponds to some 
element X, of the Lie algebra g; clearly, Xt = Z. 

Now let us prove that [X*, YT] = —[X, Y]*. Each vector X, € g determines the 
one-parameter group exp(tX-). This group coincides with the one-parameter group 
X;* of displacements during time f along integral trajectories of the vector field Xt. 
This one-parameter group can be used to calculate the commutator vector fields as 
the Lie derivative: 


E + yt a eae 
oy ya ag, 
> P t>0 t +30 ; . 


where gq = exp(tX-¢) p. 
Recall that Y7" = (1q)x(Ye). Therefore, 


exp(—1Xe)« Yr = exp(—1tXe)«(q)xYe = (exp(—tXe)1q)xYe. 
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Let us find out the structure of the map exp(—tX¢)7q: 


(exp(—tX¢)1q)(g) = (exp(—t Xe) Texpir x.) p) (8) 
= exp(—tXe)g exp(tX¢)p = ( p Rexp(r Xe) Lexp(—tX)) (8); 


therefore, exp(—tXe)%q = Wp Rexp(tX,) Lexp(—1X-): 
The left invariance of the vector field Y implies (Lexp(—rx,))xYe = Yexp(—rx,)> $0 
that 
exp(—tX.)4¥j7 = (exp(—tXe)1q) «Ye = (1p Rexp(t Xe) Lexp(—1Xe)) «Ye 


= (Tp) (Rexpirx,))* Yexp(—rX.)- 


Thus, 


exp(—tXe)4¥ji — Yo = (tp) *(Rexp(rxe))*Yexp(—tXe) — (Tp) Ye 


= (Tp) al (Rexpitx,))*Yexp(—txX.) — Yel. 
According to Problem 6.16, 


lim (Rexp(tX,))*Yexp(—1X,) — Ye 


t—0 t 


= [-X, Ye. 


Therefore, the equation [X*, Yt], = —[X, ely follows from (zp)x{—X, Yle = 


HISTORICAL COMMENT In 1884 Killing studied geometries with certain special 
properties related to infinitesimal transformations and introduced Killing fields. He 
proved that Killing fields form a Lie algebra, which had led him to the notion of a 
Lie algebra independently of Sophus Lie but 15 years later. 


6.8 Homogeneous Spaces 


Let G be a Lie group, and let K be its closed Lie subgroup. The set 
G/K = {gK\|g € G} 


of left cosets can be endowed with the structure of a manifold. Such a manifold is 
called a homogeneous space. We denote it by M and the point of M corresponding 
to the class eK = K byo. 

On the manifold M the Lie group G acts by the translations t,, where a € G. A 
translation tT, takes a point gK to the point agK. Let us check that the point ag K 
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depends only on gK and the element a. If g1K = g2K, then gj = g2k for some 
k € K; therefore, ag; K = agzk K = ag2K. This action is transitive, i.e., given any 
two points of the manifold M, one of them is mapped to the other by some element 
of the group G. 

The isotropy group of a point m € M with respect to the action of the group G 
is defined as the set of those a € G for which am = m. For the point 0, the isotropy 
group is the group K, and for a point gK, the isotropy group is the group gKg™!. 

A homogeneous Riemannian space is a Riemannian manifold M on which its 
Lie group G of isometries acts transitively. A homogeneous Riemannian space has 
the form G/K, where K is the isotropy group of some point. Indeed, choose a point 
m € M and let K be the isotropy group of this point. Then the map G/K — M 
which assigns the element gm € M to every class gK € G/K is bijective. 


Example 6.1 The sphere S" C R”*! is a homogeneous Riemannian space of the 
form O(2 + 1)/ O(n) = SO(# + 1)/ SO(n). 


Proof The isometry group of the sphere S” is the group O(n + 1) of orthogonal 
transformations. It acts transitively. The isotropy group of the point (1,0,...,0) € 


S” is the subgroup O(n) C O(2 + 1) consisting of all matrices of the form (( a) 
oO 


In Euclidean and hyperbolic spaces and the sphere the isometry group acts 
transitively not only on points but also on each tangent space. Homogeneous spaces 
are not so symmetric: for these spaces, only the action of the isometry group on 
points is required to be transitive. In a homogeneous space all points have identical 
geometric properties, but the geometries in different directions at a point may be 
different. 

Consider the natural projection p: G — G/K mapping each element g € G to 
the point gK € G/K. Let us calculate the differential p.|e: g — To(G/K) of this 
map at the point e. Let X € g. The one-parameter group exp(tX) represents this 
vector; therefore, 


d d 
pxX = Ar cee) = potas) 


t=0 t=0 


Problem 6.18 Prove that Ker p,|¢ = € is the Lie algebra of the Lie group K. 


The map px|e: g > To(G/K) is surjective; therefore, the relation Ker px|e = € 
implies the isomorphism 7,(G/K) = g/t. 

A homogeneous space G/K is a generalization of a Lie group G: a homogeneous 
space G/K is a Lie group in the special case where K is the trivial subgroup. The 
geometry of homogeneous spaces is most similar to that of Lie groups in the case of 
reductive homogeneous spaces. A homogeneous space G/K is said to be reductive 
if there exists a subspace m C g such that g = €@ mand Ad, m C mforallk € K. 
If a homogeneous space G/K is reductive, then the map px|e: g > To(G/K) 
induces the canonical isomorphism m = 7,(G/K), which is a generalization of the 
canonical isomorphism g = 7,(G). 
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Problem 6.19 


(a) Prove that if a homogeneous space G/K is reductive and g = € © m is the 
corresponding Ad(K )-invariant decomposition, then [€, m] C m. 

(b) Prove that if K C G is a connected closed subgroup and m C g is a subspace 
for which g = € @ m and [€, m] C m, then the homogeneous space G/K is 
reductive. 


For example, a homogeneous space G/K is reductive if the subgroup K is 
compact. For m we can be take the orthogonal complement to € with respect to 
an Ad(K )-invariant inner product on g. An Ad(K )-invariant inner product on g can 
be constructed by using invariant integration over the compact group K. Namely, 
we can take any inner product g(X, Y) on g and average it over the group K, i.e., 
consider the integral 


(X, y) =f s(Ad(X), Ads (dk. 


The inner product thus obtained is invariant with respect to Ad, for all k ¢€ €. 
Yet another example of a reductive homogeneous space arises in the following 
situation. 


Example 6.2 Let K be a closed subgroup of a connected Lie group G, and let o be 
an involutive automorphism of G leaving all elements of K fixed and such that the 
connected component of the fixed point set containing the identity element lies in 
K. Then G/K is a reductive homogeneous space. 


Proof Let us show that (1) € = {X € gldo(X) = X}, (2) g = € @ m, where 
m = {X € g|do(X) = —X}, and (3) Adg(m) C m forallk € K. 


(1) Since the map o leaves all elements of K fixed, it follows that da(X) = X 
for X € €. Suppose that do(X) = X and consider the one-parameter subgroup 
y(t) determined by the vector X. The curves y and o o y have the same initial 
velocity vector; therefore, 0 o y = y, i.e., all points of the curve y are fixed. 
Since the curve y passes through the identity element of G, it follows that it 
lies in the connected component of the fixed point set containing the identity 
element, and hence it lies in K. Thus, X ¢€ €. 

Any element X € g can be represented in the form X = X¢ + Xm, where 
X¢ = 4(X +do(X)) and Xm = 4(X —do(X)). Since the map a is involutive, 
it follows that so is do, which implies do (X¢) = X¢ anddo (Xm) = —Xm. 
We must show that, given X € mandk e€ K, we have do(Ad(X)) = 
— Adj (X). The equation o(k) = k implies o (kxk~!) = ko(x)k7!; therefore, 
do and Ady commute. Thus, 


(2 


New 


(3 


wm 


doa (Adg(X)) = Adg(da(X)) = Adk(—X) = — Adj (X). 
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The reductive homogeneous spaces in Example 6.2 are symmetric (see Sect. 6.9). 

Let M = G/K be a reductive homogeneous space, and let g = € © m be the 
corresponding Ad(XK)-invariant decomposition. The translations tg(gK) = agK 
act on the manifold M. Given k € K, the point o € M is fixed under the action 
T;; therefore, the map dt, acts on T,(M). It follows from the Ad(K )-invariance of 
the decomposition that the group Ad(K) acts on m. Let us show that the projection 
p: G — G/K takes the group Ad(K) acting on m to the group of transformations 
of the form dt, k € K. It is easy to check that t% o p = po Jk, where kk(g) = 
kgk!. Indeed, %(p(g)) = t%(gK) = kgK and p(k(g)) = p(kgk7!) = kgk. 
Differentiating the equation t,o p = po J at the point e € G, we obtaindt odp = 
dp o Adk. 


Theorem 6.12 Let M = G/K be a reductive homogeneous space, and let g = 
€@ m be the corresponding Ad(K )-invariant decomposition. Then the requirement 
that the map dp: m — T,(M) is a linear isometry establishes a one-to-one 
correspondence between the Ad(K)-invariant inner products on m and the G- 
invariant metrics on M. 


Proof Consider an Ad(K)-invariant inner product (, ) on m. The condition that 
the map dp: m — T,(M) is a linear isometry defines an inner product (, )o on 
T,(M). The projection p: G > G/K takes the group Ad(K) acting on m by linear 
isometries to the group of transformations of the form dt, k € K. Therefore, for 
each k € K, the transformation t,: T,(M) — T,(M) is a linear isometry. 

We use this fact to extend the inner product (, ), to the whole manifold M ina 
G-invariant way. Let us check that if t,(0) = t)(0) = x for some a, b € G, then the 
linear isomorphisms T,-1, T,-1: Ty M) — T,(M) transform the inner product (, )o 
into the same inner product (, ), on T,;(M). Indeed, it follows from t,(0) = T,(0) 
thataK = bK,i.e., ba=kek, Therefore, t,-1 0 T = T, is a linear isometry 
of the space 7,(M), and 


(t,-1(U), T,-1(V oo = (dt 0 T,-1(U), dT 0 T,-1(V) )o 
= (t-1(U), T-1(V))o. 


The Riemannian metric on M obtained by such a spread of inner product is G- 
invariant. 

Conversely, if on M a G-invariant Riemannian metric (, ) is given, then the 
transformations dt,|), where k € K, are linear isometries. The map dp» must be a 
linear isometry. Therefore, according to the remark before the theorem, the map dp 
transforms (, ), into an Ad(K )-invariant inner product on m. oO 


Theorem 6.12 shows that G-invariant metrics on a reductive homogeneous space 
G/K are a generalization of left-invariant metrics on the Lie group G. An analogue 
of Lie groups with a bi-invariant metric is naturally reductive homogeneous spaces. 
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These are reductive homogeneous spaces M = G/K with a G-invariant metric 
whose restriction to the subspace m has the property 


forall X¥, Y,Z Em. 

For a naturally reductive homogeneous space, geodesic lines can be described. 
We give this description only briefly; detailed proofs can be found in [ON2]. 

The subspace m C g is equipped with an inner product. We extend it to the 
whole space g = m@ € as follows. Choose an arbitrary inner product in space € and 
assume that €_Lm. This defines a left-invariant metric on G with respect to which all 
elements of € are vertical (tangent to the fibers of the bundle p: G — G/K) and all 
elements of m are horizontal (perpendicular to the fibers). 

To describe geodesics, we need the notion of a Riemannian submersion. A 
submersion is a smooth map of smooth manifolds whose differential is surjective 
at each point. A submersion p: M — B of Riemannian manifolds is called a 
Riemannian submersion if the map (Kerdp)+ —> TB induced on each tangent 
space is an isometry, i.e., the horizontal tangent subspace at each point x € M 
is mapped isometrically to the tangent space at the point p(x) © B. Under a 
Riemannian submersion the horizontal geodesics on the manifold M are mapped 
to geodesics on B. For a naturally reductive space, the map G/K — M isa 
Riemannian submersion. The one-parameter subgroup y(t) of G determined by a 
tangent vector X € m is horizontal. Therefore, the projection p: G > G/K = M 
takes this geodesic to a geodesic on the homogeneous space M. 


6.9 Symmetric Spaces 


Consider a complete connected Riemannian manifold M”. Given a point p € M”, 
an isometry sp: M" — M" is called a geodesic symmetry centered at p if 
Sp(expy (t)) = expy(—1) for allt ¢ R and V € T,M". The point p is an isolated 
fixed point of the geodesic symmetry s,, and cP is the identity map. The differential 
of the isometry s, at the point p is the identity map with the minus sign. 

A Riemannian manifold M” is called a symmetric space if, for any point p € M", 
there exists a geodesic symmetry centered at p. 


Remark The equation sp (expy (¢)) = expy(—t) completely determines a self-map 
Sp of a complete connected Riemannian manifold, provided that there exists a map 
satisfying this equation. However, such a map does not exist if expy (t1) = expy (2) 
and expy(—t!) 4 expy(—!). Moreover, even if such a map exits, it is not 
necessarily an isometry. 


Theorem 6.13 A symmetric space is a Riemannian homogeneous space. 
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Proof We will restrict ourselves to the proof that the isometry group of a symmetric 
space acts on it transitively; we will not prove the Myers—Steenrod theorem that the 
isometry group of a Riemannian space is a Lie group. 

Take any geodesic of finite length with endpoints p and q and consider the 
symmetry about the point dividing this geodesic into two parts of equal length. This 
isometry takes the point p to q. By definition any symmetric space is connected, 
that is, any two of its points can be joined by a curve consisting of several geodesic 
links. One endpoint of each link can be isometrically mapped to the other one; thus, 
any point of a symmetric space can be transferred to any other point by an isometry. 

oO 


Theorem 6.14 The reductive homogeneous space M = G/K in Example 6.2 
equipped with any G-invariant Riemannian metric is a Riemannian symmetric 
space, and the geodesic symmetry € centered at o satisfies the relation €0 p = poo. 


Proof Amap¢: M — M satisfying the relation ¢ o p = poo exists and is unique. 
Indeed, let us set ¢(pg) = p(og) for any g € G and check that this map is well- 
defined. If pg; = pgz, then gj K = g2K. Since the map o leaves fixed all elements 
of the group K, it follows that o(g1)K = o0(g2)K,ie., po(gi) = po(g2). 

The map ¢ is smooth (this follows from the existence of a local section of the 
submersion p). The map o is an involution, and hence so is ¢. 

Let us show that df = — id at the point o. Clearly, ¢(0) = o. If Y € T(M), 
then, as we showed in considering Example 6.2, there exists a Y € g for which 
da(Y) = —Y and dp(Y) = y. Therefore, 


d¢(y) = dg(dp(Y)) = dp(do(Y)) = dp(-Y) = —y. 
We have tog = ¢Tg¢ for all g € G. Indeed, for any x € G, 


fT p(x) = $p(gx) = po(gx) = p(og - ox) = tog P(X) = Togh p(X). 


The map ¢ is an isometry for any G-invariant metric on M. Indeed, setting V, = 
dT,-1 V € T,(M) for each V € T,(M), we see that 


(de(V), dg(V)) = (dE dtg(Vo), dE dtg(Vo)) 
= (dtagdf(Vo), dTogdE(Vo)) = (df(Vo), dE (Vo)) 
= (-VW, —Vo) =(V,V). 
It is also clear that if a Riemannian homogeneous space has a geodesic symmetry 


¢ with respect to o, then it has a geodesic symmetry with respect to any point p = 
T(o), namely, t¢t~!. oO 


HISTORICAL COMMENT In 1926 Cartan began to study Riemannian symmetric 
spaces; he classified them by 1932. 
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6.10 Solutions of Problems 


6.1 For the constant y = e (the identity element of the group), the map (x, e) 
x~le = x7! is smooth; therefore, so is the map x +> x~!. The composition 


(x,y) (xT! y)  (x7!)o!y = xy is smooth as well. 


6.2 The required vector field can be obtained by taking n linearly independent 
vectors in the tangent space at e and extending them to left-invariant vector fields 
on the Lie group. 


6.3 It is required to prove that if vector fields X and Y are left-invariant, then 
LexlX,¥]p = [X,Y] gp for all p € G and all g ¢€ G. To this end, it suffices to 
check that the actions of both vectors on a function f yield the same result. Clearly, 


(LexlX, Vp) f = [X, V1 p(f o Lg) = XpV(F 0 Lg) — Yp(X(f 0 Le) 
= X)/(V(f) 0 Lg) — Yp(X(f) o Lg) 
= Lex(Xp)V(f)) — Lex(Vp)(X(f)) 
= Xep(V(f)) — Vep(X(P)) = CX, Vey) f- 


6.4 Suppose that a tangent vector V at the identity matrix J is determined by a 
curve V(t), ie., V(0) = J and V’(0) = V. For example, we can take the curve 
V(t) = 14+ tV. Ata point X ¢€ G the left and right translations of this vector 
are determined by the curves XV(t) = X + ¢tXV and V(t)X = X +1tVX. The 
left-invariant commutator of vectors V and W is the commutator at the point J of 
the vector fields given at X by the curves X + tXV and X + tXW, and the right- 
invariant one is the commutator of vector fields determined by the curves X + tV X 
and X + tWXx. 

Let us calculate these commutators. Note that the action of a vector field on the 
linear functions completely determines this field, and therefore it suffices to consider 
linear functions. Take the linear function f4(X) = AX. On the curve X +1X W this 
function equals AX + tA XW; the derivative of this expression with respect to f is 
AXW. Thus, under the action of the left-invariant field W on fa we obtain the 
function g(X) = AXW. The action on this function of the left-invariant vector 
field V“, which is determined by the curves X + tXV, is as follows. First, the 
expression A(X + tXV)W = AXW + tAXVW is obtained; differentiating this 
expression with respect to f and setting X = J, we obtain AV W. It follows that the 
left-invariant commutator acts on f4 as f4  A(VW — WV), which corresponds 
to the commutator of the matrices V and W. 

Now let us calculate the commutator of right-invariant vector fields. The vector 
field W* acts of the linear function f4(X) = AX as follows: first, the expression 
fa(X+twx) = AX+tAWX is obtained, and then its derivative with respect to ¢ is 
taken, which equals AW X. Thus, the action yields the function g(X) = AWX. To 
find the result of the action of the vector field V* on this function, we differentiate 
the expression AW(X+tVX) = AWX+tAWVX with respect to t and set X = /. 
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As aresult, we obtain AW V. It follows that the right-invariant commutator acts on 
Aas fa t+» A(WV — VW), which corresponds to the commutator of the matrices 
V and W with the minus sign. 


6.5 Consider the curve y(t) = (1(t), y2(t)), where y1(0) = g, y; (0) = V,, 
y2(0) = h, and y3(0) = W,. The map uw takes this curve to the curve yj (t)y2(t). 
The tangent vector to this curve at t = 0 equals 


d d d 
TNOMO)| = ZOOM) + TERO)] = RieVe + LeeWh. 


6.6 


(a) Suppose that a vector V, is represented by a curve X(t), 1.e., X(0) = g and 
X'(0) = Vg. Then the vector Inv,. Vz is represented by the curve X —l(t). Let us 
differentiate the equation X (t)X —l(t) = I, where J is the identity matrix. For 
t = 0, we obtain Veg7! + g Inv, Vz = 0,i., Inv, Vg = —g!Vegr!. 

(b) Let us apply the formula of Problem 6.5 to the case where h = g~! = Inv(g) 
and W, = Inv, Vg. We have y(g, Inv(g)) = e, which is a constant quantity, 
whence [x(Vz, Inv, Vg) = 0, ie., Ry-1,V¢ + Lgx Inv, Vg = 0. Therefore, 


Inve Vz = —(Lge) | R,-1, Vs. 


6.7 Using the formula of Problem 6.6, we obtain (Inv, X ig = Inv, (X 1) = 
—(Lg-ty)” | Rex(X7 1) = Sag) Reha Me Since the operations L,, and 
Rnx commute, it follows that (Lit, ) Reply Xe = Re) “Lyi Xe a 
Rake XP. 


6.8 Consider the vector We = [X“,Y#]e. It generates the left-invariant vector 
field W’ and the right-invariant vector field W*. According to Problem 6.7, 
we have (Inv,[X“,Y“]). = (Inv.,W4). = -(W®), = -[X4,Y"]. and 


(Inv,[X4, Y“]). = [Inv, X“, Inv, Y4]. = [-—X*, -—Y*], = [X*, Y*]e. The left- 
hand sides of these equations are equal, and hence so are the right-hand sides. 


6.9 The required identity follows from (6.3), because Ad4(Xe) = AX,A~! fora 
matrix Lie group. It can also be easily derived directly from the series expansion of 
the matrix exponential by using the identity (AX A~!)” = AX”A7!. 


6.10 We set A(t) = Adexpix) and calculate the derivative: 
P d 
A(t)Y = Pree exp(—tX)) 


= X exp(tX)Y exp(—rX) + exp(tX)Y exp(—tX)(—X) 
= (adx) Adexpwx) Y = (adx)A(d). 
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Thus, A’(t) = U A(t), where U = ady and A(0) = e. The unique solution of this 
equation has the form A(t) = exp(tU). For t = 1, we obtain the required identity. 


6.11 Ina sufficiently small neighborhood U of the identity element of a Lie group 
G, for each g € U, we can choose a unique X € g satisfying the condition g = 
exp(X). Recall that Adexpcx) = e@dx according to Problem 6.10. Therefore, Adg = 
Adexp(x) = e*4x The skew symmetry of the matrix ady implies the orthogonality 
of the matrix Adg = ene, 

In aconnected Lie group any element can be represented as a product of elements 
in a neighborhood of the identity element. Clearly, the product of orthogonal 
matrices is an orthogonal matrix. 


6.12 It was shown in the solution of Problem 6.6 that Inv, Vz = —(Lgx)~ : Ro-14Ve- 
By assumption both maps R,-1,, and Lg, are isometries. 


6.13 The composition under consideration is given by (g,V) re 
(g, (Ry-1)x(Lg)«V) = (g, (Ro-1Lg)«V) = (g, Adg(V)). 


6.14 Consider a left-invariant vector field a/X j- The values of both the left- 
invariant form @ and the left-invariant form X,; ® w' at this vector field are equal 
to al X ej: 


6.15 Since the Lie group G is compact, we can choose an Ad-invariant inner 
product in the space g. With respect to this inner product each operator ady is skew- 
symmetric. The matrix (a;;) of this operator in an orthonormal basis of the space g 
is skew-symmetric, whence 


B(X, X) = tr(ady oady) = > a ajay = — > ap <0. 
ij i,j 


6.16 According to Theorem 6.1, the displacement during time —t along a trajectory 
of a left-invariant vector field X starting at e is the transformation Rexp(—rx,)- 
Therefore, the required relation follows directly from the fact that the commutator 
of vector fields is the Lie derivative. 


6.17 According to Theorem 6.1, the flow X; of a right-invariant vector field X on 
a Lie group is the left translation by the image of the identity element, i.e., X;(g) = 
exp(tX)g. On the Lie group with a left-invariant metric the transformation X; is an 
isometry for each t. Therefore, X is a Killing field. 


6.18 If X © €, then exp(tX)K = K forall t. Therefore, p,.X = 0. 
Now suppose that p,.X = 0. This means that, for any function f on G/K, we 
have 


d 
7) = 0. 


t=0 
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In particular, this equation holds for the function f,(gK) = f(exp(sX)gk), 
whence 


o 
| 


d 
= qe) 


t=0 


= < flexp(sX) exp(tX)K) 


t=0 


< flexpls +1)X)K) 


t=0 


Letr =s+t. Then 


d d 

—flexprxX)K)| = —f(exp((s+1)X)K) 
dr r=s dt t=0 

Thus, f(exp(rX)K) does not depend of r, so that exp(rX)K = K for any, i.e., 
the curve exp(r X) lies in K. This means that X € €. 


6.19 


(a) For X € € and Y © m, we have exp(tX) e€ K, whence it follows that 
Adexp.x)(Y) € m and the curve 


V(t) = Adexpaxy(Y) = e!*4*(¥) 


2 
t 
= (I +tady +5 (adx)? Saw Dg 


2 
=¥+41X, YI1+ SIX KX YI+ 


lies in the subspace m. Therefore, [X, Y] = y’(0) € m. 

(b) If X € € and Y e€ m, then Y; = [X,Y] € m, whence (ady)?(Y) = 
[X, [X, Y]] = [X, V1] € m. Similarly, (ady)”(Y) € m for all positive integers 
n. Therefore, Adexpax)(Y) = e!@dx (Y) € m forall t, and hence Ad; (Y) € m for 
any element k in the subgroup generated by exp(€). If the group K is connected, 
then this subgroup coincides with K. 


Chapter 7 ® 
Comparison Theorems, Curvature and ped 
Topology, and Laplacian 


7.1 The Simplest Comparison Theorems 


We have already proved some comparison theorems for surfaces by using Sturm’s 
theorem (see Theorem 3.26 on p. 119). Using the same theorem, we can also prove 
similar comparison theorems in the case of Riemannian manifolds, where the role 
of curvature is usually played by sectional curvatures. We will need Jacobi fields J 
on a geodesic y and the Jacobi equation 


Void = Ry’, Dy’. 


We will consider normal Jacobi fields, i.e., those orthogonal to the geodesic. 


Theorem 7.1 (comparison Theorem for Jacobi Fields) Suppose that all sec- 
tional curvatures on a Riemannian manifold M are bounded above by a constant 
c. Let y(t) be a geodesic with natural parameter on M, and let J(t) be a normal 
Jacobi field on y for which J (0) = 0. Then the following assertions hold: 


° ifc =O, then ||J(@)|| 2 t\|V)" J O)|| fort 2 0; 
© ifc = pr > 0, then ||J()|| > Rsin £||Vy/JO)|| forO <t < aR; 
° ifc=— zy <0, then ||J()|| > R sinh £I|Vy/J(O)|| fort > 0. 


Proof The function || J (t)|| is smooth at t 4 0. Let us calculate its second derivative 
by using the Jacobi equation. First, note that LJ, J) =2(V,/J, J), and hence 


eo 
ad ¢ Ae 
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Therefore, 


d° y= 2 Wyte) 
dt? ~ dt (J, J)'/2 


VID Cy VyJ) Vy I, IP 


“CN? CN CP 
RY DVD Wy FIP Wy, J? 
lv Iv Iie 


The contribution of the last two terms is nonnegative, because (V,J, J ye 
Vy J \|7|| J ||? by virtue of Cauchy’s inequality. 

The vectors J and y’ are orthogonal, and ||y’|| = 1. Therefore, the sectional 
curvature of the plane spanned by J and y’ equals 


RY WD RO DvD 
vie vl vr 


Thus, it follows from the assumptions of the theorem that 


a 
Gp lll 2 —cllJ I 


for || J || > 0. 

Now we can use Sturm’s theorem. We set y(t) = k||J(¢)|| and choose a constant 
k so that y’(0) = 1. The function y(r) satisfies the differential equation y”+a(t)y = 
0, where a(t) < c. Consider the solution of the differential equation z” + cz = 0 
satisfying the conditions z(0) = 0 and z’(0) = 1. (For c = O, this solution has the 
form z(t) = t, for c > 0, it has the form z(t) = Rsin * and for c < 0, it has the 
form z(t) = R sinh z) It can be derived from Sturm’s theorem that y(t) > z(t) if 
t > Oand z(t) # 0. The solution z(t) can vanish at a positive t only if c > 0; in this 
case, the minimum ¢ at which z(t) vanishes equals 2 R. Let us explain how Sturm’s 
theorem can be applied in this situation. To apply it, we must make sure that y(t) 
cannot vanish on the half-open interval on which z(t) > 0. For small f, the positivity 
of y(t) follows from y’(0) = 1, and for larger rt, the inequality y(t) > z(t) shows 
that y(t) cannot vanish until so does z(f). 

To obtain the required result, we must check that k = Woror 1.e., £ | (0) || = 
|| V,,, J (O)||. For this purpose, we can use Theorem 5.22: the Jacobi vector field J 
for which J (0) = 0 and V,,,J(0) = W' 9; is given in normal coordinates by J(t) = 
tW'd;. Indeed, it follows from this relation that 4 \| J(O)|| = || W!d;]]. oO 


Theorem 7.2 (comparison Theorem for Conjugate Points) Suppose that all 
sectional curvatures on a Riemannian manifold M are bounded above by a constant 


c. If c < 0, then M has no points conjugate along a geodesic, and if c = re > 0, 
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then the distance from any point to a nearest point conjugate to it along some 
geodesic is at least m R. 


Proof Let c < 0. Then, according to Theorem 7.1, any nontrivial normal Jacobi 
field J(t) for which J(0) = O does not vanish at t > 0. Moreover, if c = re > 0, 
then J(t) does not vanish atO <t <7R. oO 

Theorem 5.25 (see p. 196) describes the metric of a space of constant curvature 
c in normal coordinates. Let g-(V, V) denote the metric specified in the statement 
of this theorem for a space of constant curvature c. 


Theorem 7.3 (Comparison Theorem for a Metric) Suppose that all sectional 
curvatures on a Riemannian manifold M with Riemannian metric g are bounded 
above by a constant c. Then g(V, V) > g-(V, V) in any normal coordinate system. 


Proof As in the proof of Theorem 5.25, we decompose the vector V in two 
components, a component V” tangent to the geodesic sphere and a component V+ 
directed along a radial geodesic, i.e., perpendicular to the geodesic sphere. From the 
orthogonality of these two components it follows that 


g(V,V)=e(V", Vv") + e(Vt, V+). 


The velocity vector of a radial geodesic has the same length in the metrics g and 
&c: it equals the length of this vector in Euclidean norm associated with the normal 
coordinates. Therefore, g(V ce y= gc(V iV ). The vector V! coincides with 
the vector of some normal Jacobi field vanishing at t = 0. Applying the comparison 
theorem for Jacobi fields to this vector, we obtain g(V7, V") > #4" v’). a 


7.2 The Cartan—Hadamard Theorem 


Recall that an isometry is a diffeomorphism of Riemannian manifolds transforming 
the Riemannian metric of one of the manifold into the Riemannian metric of the 
other. A local isometry is a map of Riemannian manifolds which transforms the 
Riemannian metric of one of the manifolds into the Riemannian metric of the other. 
Any local isometry is a local diffeomorphism. 


Problem 7.1 Suppose that local isometries gy; and ¢2 from a connected Rieman- 
nian manifold M onto a Riemannian manifold N coincide at a point x € M and the 
differentials of g; and ¢ at this point coincide as well. Prove that then the maps ¢ 
and @2 coincide. 


To prove the Cartan—Hadamard theorem, we need the following lemma about 
local isometries of geodesically complete manifolds. Recall that, according to the 
Hopf-Rinow theorem, the geodesic completeness of a Riemannian manifold is 
equivalent to its topological completeness. 
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A map p: M — N of connected manifolds is called a covering if each point 
x € N has aneighborhood U such that its preimage is homeomorphic to the product 
of U and a discrete set F and the restriction of p to U x F is the natural projection 
U x F — U. Itis easy to show that, in this case, the sets F at different points x « N 
have the same cardinality. In particular, the image of the map p coincides with N. 


Lemma /f manifolds M and N are connected and M is geodesically complete, then 
any local isometry p: M — N is a covering. 


Proof Consider any point x € N and choose a positive number r so that the map 
exp, is a diffeomorphism from an open ball of radius r in T, N onto a geodesic ball 
U of radius r in the manifold NV. 

The preimage of a point x under p is a set of points y; in the manifold M. For each 
point y;, consider the image U; of an open ball of radius r in 7), M under the map 
exp,,. The local isometry p takes each geodesic in M to a geodesic in N. Therefore, 
the radial geodesics in the geodesic ball U; are mapped to radial geodesics in the 
geodesic ball U. Taking into account the geodesic completeness of the manifold M, 
we see that the map p: U; — U is a diffeomorphism. 

Now let us check that p~!(U) is contained in the union of the sets U;. Take any 
point g € U; it can be joined with the center x of the geodesic ball by a unique (up to 
parameterization) minimal geodesic y. We assume that y (0) = g and y(s) = x and 
set V = y’(0). Let g be a point in the preimage of q, i.e., such that p(q) = q. The 
space 7; M contains a unique vector V for which p»V = V. Let y be a geodesic 
from @ with initial velocity V. Since the manifold M is geodesically complete, it 
follows that y = y(s) is defined. Clearly, p(y) = x, ie., y is one of the points yj. 
Thus, g € Uj. oO 


The following example shows that the geodesic completeness assumption on the 
manifold M is essential. 


Example Consider a covering S? — RP?. Let us introduce a metric on the 
projective plane RP? so that the given covering is a local isometry. The given map 
restricted to the sphere S* minus a point is a local isometry but not a covering. 


Theorem 7.4 (Cartan—Hadamard) Let M be a complete connected Riemannian 
manifold all of whose sectional curvatures are nonpositive. Then, for any point y € 
M, the map exp,: TyM — M isa covering. In particular, the universal covering 
of the manifold M is diffeomorphic to R". Moreover, if M is simply connected, then 
M is itself diffeomorphic to R". 


Proof Since all sectional curvatures are nonpositive, it follows that the manifold 
has no points conjugate along a geodesic. In turn, this implies that the map 
exp,: TyM — M has no critical points. Therefore, the map exp,: Ty M > M 
is a local diffeomorphism. This makes it possible to construct a Riemannian metric 
on Ty M as follows. We take a point a € Ty M, consider the point exp, (a) € M, and 
lift the Riemannian metric g at this point by means of the map exp,. As a result, we 
obtain a Riemannian metric g on TM for which exp, is a local isometry. 
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To apply the lemma, we must verify that the Riemannian manifold 7,M with 
metric g is geodesically complete. First, note that the geodesics through the origin 
are straight lines, and the velocity of motion along them is constant. Indeed, these 
lines are mapped to parameterized geodesics on the manifold M. Now we can apply 
the fact that a connected Riemannian manifold is geodesically complete if it has a 
point such that the map exp is defined on the entire tangent space to this point (see 
the remark to the Hopf—Rinow theorem on p. 178). Oo 


HISTORICAL COMMENT For surfaces in Euclidean space, Theorem 7.4 was first 
proved by Hans Carl Friedrich von Mangoldt (1854-1925) in 1881. In 1898 it was 
proved independently by Hadamard. The general form of this theorem was proved 
by Cartan in 1928. 


7.3. Manifolds of Positive Curvature 


The simplest comparison theorems of Sect. 7.1 deal with the situation where the 
sectional curvature is bounded above. In the case where it is bounded below by a 
positive constant, the following statement can be proved by a similar method. 


Theorem 7.5 Suppose that all sectional curvatures on a Riemannian manifold M 
are bounded below by a positive constant c = ren Then any part of a geodesic on 
M which is longer than x R contains conjugate points. 


Proof In the proof of Theorem 7.1 we showed that the function y(t) = k||J(¢)|| 
dominates the function z(t) = RsintR. Now we have the reverse inequality, so 
that the functions y and z should be interchanged: the function y(t) = RsintR 
dominates the function z(t) = k||J(4)|| Gin the region where both these functions 
are positive). Since the function y(t) vanishes at t = 7 R, it follows that z(t) must 
vanish att <7R. oO 


We define the diameter of a Riemannian manifold to be the least upper bound for 
the distances between its points. For a sphere of radius R, the diameter thus defined 
equals zr R (the longest distance is between diametrically opposite points). 


Theorem 7.6 (Bonnet) Let M be a complete connected Riemannian manifold all 
of whose sectional curvatures are bounded below by a positive constant re Then 
M is compact and its diameter is at most 1 R. 


Proof First, we prove that the diameter of M is at most 27 R. Suppose that the 
distance between points x and y in M is greater that 7 R. According to the Hopf— 
Rinow theorem, they can be joined by a minimal geodesic. Since the length of this 
geodesic is greater than z R, it follows from Theorem 7.5 that it contains conjugate 
points. But such a geodesic cannot be minimal. This contradiction shows that the 
diameter of M is at most 7 R. 
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Now let us prove that the manifold M is compact. Take any point x € M. It can 
be joined with any point y € M by a geodesic of length at most z R. Therefore, the 
image of the ball of radius R centered at the origin under the map exp, coincides 
with M. Thus, M is the image of a compact set. Oo 


Using Bonnet’s theorem, we can prove the following statement. 


Theorem 7.7 Let M be a complete connected Riemannian manifold all of whose 
sectional curvatures are bounded below by a positive constant. Then the fundamen- 
tal group of M is finite. 


Proof Consider the universal covering p: M — M. The preimage of any point 
x € M under p is a discrete set of cardinality equal to that of the fundamental 
group. Therefore, it suffices to prove that the manifold M is compact. 

The map p can be used to lift the metric g of M to the manifold M: the lifted 
metric is defined by g = p*g. For this metric, all sectional curvatures are bounded 
below by the same positive constant. Moreover, it is complete. Indeed, the map p is 
a local isometry, so that we can take a geodesic on M, map it to a geodesic on M, 
extend the latter (using the completeness of MW), and then lift the extended geodesic 
to M. Oo 


The statement concerning the boundedness of the diameter of a Riemannian 
manifold remains valid under weaker assumptions than those of Bonnet’s theorem. 
It is sufficient to require the boundedness of the Ricci tensor rather than of 
all sectional curvatures. The assertions about the compactness of a complete 
Riemannian manifold and the finiteness of its fundamental group remain valid under 
these assumptions, and their proofs do not change. 


Theorem 7.8 (Myers) Let M be acomplete connected n-dimensional Riemannian 
manifold for which 


n—1 


arlvie 


Ric(V, V) > 


Then the diameter of M is at most 1 R. 


Proof Consider two points x and y in the manifold M. According to the Hopf— 
Rinow theorem, they can be joined by a minimal geodesic y. Suppose that the length 
of this geodesic equals /. We assume that the parameterization of this geodesic is 
natural and y (0) is its initial point. 

Let us apply the second variation formula. For a geodesic y of length / with 
velocity vector y’, the quadratic form E, is calculated by 


l 
Exx(W, W) = -| (W, (Vy) W — Ry’, W)y’)dt, 
0 


where W is the variation of the geodesic. For a minimal geodesic, we have 
Exx(W, W) 2 0. 
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Choose a basis e1, ..., €,—1 in the orthogonal complement to the vector y’(0) 
and consider its parallel transport along the geodesic. We obtain vector fields Xj (t), 
.., Xn_1(t). Fori = 1,..., — 1, consider the vector fields 


(= sin (= i +) xin, 


Each vector field X; is parallel along the gendesic, and hence V,,X; = 0. Therefore, 
Vy¥i = 7 cos (34) X; and (V,7)*¥; = —2 sin (Zt “) X;. Thus, 


l 
Exx(Vi, Yi) = -{ (Yi, Vy Yi — Ry’, Yi)y'at 


to fat) fa 
= mn |e Fr t RO", Xi, y', Xi) 


According to Problem 5.9, 


n—1 n—-1 n—1 
Ric(y’, y') = Do K(X y= 0 RX y', Xi) = — DRO, Xi y', Xi). 


i=1 i=1 i=1 


Therefore, 


I ut m2 _ 
i sin? (=) (« = I= — Ric(y’, y) dt = dX, Ex (Yi, Yi) 20 


By assumption Ric(y’, y’) > aa Therefore, 4, > a which means that / < 7 R. 


I 
oO 
HISTORICAL COMMENT Sumner Byron Myers (1910-1955) proved Theorem 7.8 
in 1941. 


7.4 Manifolds of Constant Curvature 


Theorem 5.25 on p. 196 describes the local structure of a space of constant cur- 
vature. Now, applying the Cartan—Hadamard theorem and the comparison theorem 
for conjugate points, we can also describe the global structure of a complete simply 
connected space of constant curvature (i.e., the universal covering of any complete 
space of constant curvature). 


Theorem 7.9 A complete simply connected space of constant curvature is isometric 
to either hyperbolic space, Euclidean space, or the sphere. 
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Proof To a space of constant negative or zero curvature we apply the Cartan— 
Hadamard theorem, and to a space of constant positive curvature, the comparison 
theorem for conjugate points. Consider these two cases separately. 

Let M be acomplete simply connected space of constant negative or zero curva- 
ture. Then, according to the Cartan—Hadamard theorem, the map exp, : 7; M —> M 
is a covering for each point x € M. Since M is simply connected, it follows that 
exp,: TyM — M is a diffeomorphism. Lifting the metric g of M to T,M, we 
obtain a metric g on 7,.M, with respect to which the map exp,: 7; M — M is an 
isometry. The Euclidean coordinates on 7; M are normal coordinates for the metric 
g; therefore, the metric g is defined by one of the formulas in Theorem 5.25. Thus, 
M is isometric to a hyperbolic or Euclidean space. 

Now suppose that M” is a complete simply connected space of constant positive 
curvature c = a Let us show that it is isometric to the sphere S‘p of radius R. 
Take diametrically opposite points a and —a on the sphere Sp. The map exp, is 
a diffeomorphism from an open ball of radius 7 R onto the sphere Sp minus the 
point —a. According to the comparison theorem for conjugate points, the distance 
between any conjugate points on the manifold M” is at least 7 R; hence, for any 
point x € M”, the restriction of the map exp, to an open ball of radius z R in T, M” 
is a local diffeomorphism. 

Take any isometry @q from an open ball of radius z R in T,S‘p onto an open ball 
of radius 2 R in T,M”. Consider the lifts to an open ball of radius z R in TaSr of 
the metric on the sphere (by means of the map exp,) and of the metric on M” (by 
means of the map exp, og). These are two metrics of constant curvature a on this 
ball. The Euclidean coordinates on the ball are normal coordinates for both metrics, 
because the radii of this ball are geodesics. Therefore, these metrics coincide. Thus, 
the map 


Dy = exp, ogg oexp,': S%, \ {—a} > M" 


is a local isometry onto its image. 

On the sphere Sp choose any point b different from a and —a and consider the 
map ®» defined as follows. Let ®,(b) = y. Consider the isometry gp, from an 
open ball radius z R in 7,S‘p onto an open ball of radius 2 R in Ty M" which is the 
restriction of the isometry (®,),: TpS'p > TyM". We set 


Dz = exp, og © exp, |: Sp \{-b} > M". 


This map is a local isometry onto its image. 

The maps ®, and ®, take the point b to the point y. The differentials of these 
maps at the point b coincide as well: (®q)x(b) = (@p)x = (®p)x(b). Therefore, 
according to Problem 7.1, the maps ®, and ®, coincide wherever both maps are 
defined. Gluing together these maps, we obtain a globally defined local isometry 
from Sp onto M”. Therefore, M” is compact, and we can use the fact that a 
local diffeomorphism from a connected compact manifold onto a simply connected 
manifold is a (global) diffeomorphism (see, e.g., [Pr2]). oO 
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7.5 Laplace Operator 


Let V be a vector space of dimension n equipped with an inner product and an 
orientation. Then we can define a linear operator, called the star operator, which 
maps the pth exterior power of the space V to its (n — p)th exterior power. Suppose 
that vectors €), ..., @p have unit length and are pairwise orthogonal. We complete 
them to a positively oriented orthonormal basis e1, ..., é, and set 


*(€1 A+ A @p) = Cpti A+? A en. 


To the whole space A? V the star operator is extended by linearity. 


Problem 7.2 Prove that the map **: A?V — APV is multiplication by 
(—1)P"-P), 


For an orientable Riemannian manifold M”, the star operator is locally defined 
by 


*(@1 A+++ A @p) = Opti A+++ A On, 


where the basis @1, ..., @, is dual to a positively oriented orthonormal basis. 
According to Problem 7.2, given a p-form w, we have * * @ = (—1)?"-P)a. 
Consider the operator 6 defined by 


dom = (-1)?*"™ x dxa 


(the choice of sign will become clear from Theorem 7.10 and its proof). 

The operator 6 is defined not only on an orientable, but also on a nonorientable 
manifold, because the star occurs twice in its definition. Indeed, we can take local 
coordinates. When the orientation of these coordinates changes, the star operator 
changes sign and the operator 6 remains intact. 

Now we can define the Laplace operator, or Laplacian: 


A=doé+é6éod. 


Suppose that M” is a closed manifold. Then in the space of p-forms on this 
manifold we can define (inner) product by 


jon =f w A *N. 


This product is linear, and (w,) = (n, @), because w A *n = n A xo. Ifo = 
= Gi ..ip®iy A+++ A Qi, in local coordinates, then 


2 : 
OA *O = Yo Gi,.i 1 A+++ A Qn} 


therefore, (w, w) > 0 and (w, w) = Oif and only if w = 0. 
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Theorem 7.10 For any p-form w and (p + 1)-form n, the identity (dw,n) = 
(w, dn) holds. 


Proof Let us integrate the identity 
d(@ A *n) = dw A*«n+(-1)?aAdx*n 


over the closed manifold M”. Since the integral form d(@ A *n) vanishes, it follows 
that 


(dw, n) = (—1)?! / od *n. 
mn 
For the (n — p)-form d * n, we have 
# & (dn) = (—1L)P Pd * 1; 
therefore, 
(<1)? Nd 9 = (—DP 11) POP) ed 7) 
= (-1)"?t! x (xd * n). 
By definition we have 
én = Tee xd*n= (1yr"! ed*n 
for a (p + 1)-form 7; hence 
(—1)?-!d *n = *65n. 


Thus, 


(deo) = (-0)" f orden= f w A *dn = (w, dn). 
MM” M” 


oO 


A form @ is said to be harmonic if Aw = 0. Clearly, if dw = 0 and dw = 0, then 
the form w is harmonic. Using Theorem 7.10, we can show that the converse is also 
true: if a form w is harmonic, then dm = 0 and dw = 0. Indeed, for any p-form w, 
we have 


(Aw, w) = (dda, w) + (dda, w) 
= (da, d@) + (da, do). 
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Therefore, if Aw = O, then da = O and dw = O, because (dw, dw) > O and 
(dw,dw) > 0 

For a differentiable function f on a Riemannian manifold, the Laplace operator 
can be expressed as 


af =-—a55 (vie 4). (7.1) 


where g = det(g;;). Before proving this, we remark on what the form (1) (the 
result of applying the star operator to the constant function equal to 1) is. 

Let e1, ..., @n be a positively oriented orthonormal basis, and let v1, ..., vp be 
any positively oriented basis. Then 


A+++A Un = ,/det(uj, vj)er A+++ A €n = ,/det(u;, vj) * (1). 


The metric of the cotangent space is given by (g/) = (g; ats therefore, in 
positively oriented local coordinates, we have 


#(1) = ,/det(gij)dx' A--» A dx", 


1.e., *(1) is the volume form. 
For a 0-form f, the equation df = 0 holds, and hence any compactly supported 
differentiable function ¢ satisfies the relation 


[ar eveds! ne nds" = (Af, 9) = (df, dg) 
= fas dg) * (1) 
Has wh gd! no nd" 
= a S ainiee A+++ A dx! 


This completes the proof of formula (7.1). 


7.6 Solutions of Problems 


7.1 Let A be the set of all points y € M at which the maps g and @> and their 
differentials coincide. This set is closed and nonempty; therefore, it suffices to prove 
that it is open. Take any point y € A and consider a neighborhood of the origin in 
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Ty M on which the map exp, is a diffeomorphism to a neighborhood U of the point 
y. Let us show that the neighborhood U of y is contained in A. Take z € U; we 
have z = exp, (V) for some vector V. If yy (t) is a geodesic for which yy (0) = y 
and y;,(0) = V, then yy(1) = z. The local isometry gy takes a geodesic with initial 
velocity vector V to a geodesic with initial velocity vector g,,V; therefore, 


giz) = p(w) = Ye).VO) = Ym).v ) = 92(vv (1) = G22). 


Thus, the maps g and ¢2 coincide on the entire neighborhood U, and hence their 


differentials coincide as well. 


7.2 Let e1,..., @n be a positively oriented orthonormal basis. Then 


*#(€, A+++ A€p) = *(Cpp1 A+++ A@n) =F 
Here the sign coincides with that of the orientation of the 


en. 


ré] A-:-A@p. 


basis €p+41,..., ns @1,--- 
€»- Performing p(m — p) transpositions of its elements, we obtain the basis é, ... 


> 


> 


Chapter 8 M®) 
Appendix crests 


8.1 Differentiation of Determinants 


In differential geometry it is sometimes required to differentiate the determinant 
of a one-parameter family of matrices. Below we prove two assertions about the 
differentiation of determinants, which we use in this book. 


Theorem 8.1 Let A(t) be a family of matrices smoothly depending on t, and let 
A(0) be the identity matrix. Then det’ A(O) = tr A’(0). 


Proof Choosing a basis e1, ..., €n, We can associate each matrix with an operator. 
We have 


det A(t)e, A---Ae@y = A(tyey A--- A Alten. 
Differentiating this equation, we obtain 


det’A(t)e, A+++ Aen = Y > AM er A+: A Al(Deg A+ A Aen. 
k 


For t = 0, we have 


det’ AQ) e1 A+++ Aen = Yer A+ A A'Oeg A+++ A en 
k 
=) lay O)e1 A+++ Aen = tr A'(O) er A+++ Aen; 
k 


therefore, det’ A(O) = tr A’(0). Oo 


Theorem 8.1 is a special case of Theorem 8.2. 
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Theorem 8.2 Let A(t) be a family of matrices smoothly depending on t. Then 
det’A = (det A) tr(A’A~!). 


Proof First, we prove that det’A = tr(A’ adj A’), where adj denotes the operation 
of taking the adjugate matrix. Let A;; denote the cofactor of the entry a;;. Then, 
on the one hand, tr(A’ adj A?) = vii a; Aij, and on the other hand, detA = 
aj Aij +..., where the dots denote the terms not containing a;;. Therefore, det’'A = 
qj; Aij +aij Aj a a ai, Ajj +..., where the dots denote the terms not containing 


qi. Thus, det A = vii a, Aij = tr(A’ adj A’), 
The required relation now follows from adj A! = (det A)A—!. 7 


8.2 Jacobi Identity for the Commutator of Vector Fields 


Theorem 8.3 For any vector fields X, Y, and Z on a manifold, the identity 
(LX, ¥], Z] +11, Z], X]+[[Z, X], Y] =0 


holds, which is called the Jacobi identity. 


Proof Associating each vector field X with the operator dx of function differentia- 
tion in the direction of this vector field, we associate the vector field [X, Y] with the 
operator dydy — dydx. Taking into account the associativity of the composition 
of operators, we see that the vector fields [[X, Y], Z], [[Y, Z], X], [[Z, X], Y] 
correspond to the operators 


dx Oy 0z — Oy 0x0z — Ozdx dy + Ozdy Ox, 


dy 0z0x — Ozdy dx — Ox dy dz + OxdzOy, 
0z0x 0y — OxO0zoy — Oydzox + Oy Odxdz. 


The sum of these operators vanishes. Oo 


HISTORICAL COMMENT Jacobi discovered the identity for the Poisson bracket in 
1842-1843 in studying equations of dynamics. Similar identities for vector product, 
for the commutator of vector fields, and in a Lie algebra are also called the Jacobi 
identity. 
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8.3. The Differential of a 1-Form 


On p. 77 we defined the differential of a 1-form w = fydx® as the 2-form 


af 


dw = dfy \ dx* = —dx' A dx”. 


In some situations the following expression for the differential of a 1-form is useful. 
Theorem 8.4 The differential of a 1-form w is expressed in terms of the derivatives 
of the 1-form in the directions of vector fields and the commutator of these fields as 


dw(X, Y) = dx(@(Y)) — dy(@(X)) — w([X, Y]). 


Proof We set Q(X, Y) = dx(w(Y)) — dy (@(X)) — w([X, Y]). First, we show that 
Q(eX, Y) = gQ(X, Y) for any function g. Indeed, 
(9X, Y) = dox (@(Y)) — dy (@(pX)) — w([eX, Y]) 
= 90x (W(Y)) — dyp - @(X) — gdy(@(X)) 
— po([X, Y]) + yg - a(X) 
= Q(X, Y). 


Similarly, Q(X, gY) = g@Q(X, Y). 
It es to sora that Q(X, Y) = daw(X, Y) for the coordinate vector fields 


X = x; and ¥Y = =. Clearly, fodx°(Y) = fj and dx(fadx*(Y)) = oot Since 
the coordinate vecttr nflelds commute, it follows that : 
of; 0 
Q(X, Y) = afi Ohi = dw(X,Y). 
axt  axd 
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Hopf, H. (1894-1971), 9, 178 
Hopf—Rinow theorem, 175, 247 
Horizontal space, 157 
Hypersurface, 145 

minimal, 150 
Hypocycloid, 18 


I 
Identity 
Bianchi 
first, 162 
second, 166, 182 
Jacobi, 258 
Immersed surface, 65 
Indicatrix of velocity, 53 
Inequality 
isoperimetric, 23 
Wirtinger, 25 
Infinitesimal isometry, 233 
Inflection point, 6, 26, 32 
Inner automorphism, 218 
Integral 
curve, 200 
geometry, 33 


Invariant measure, 34 
Involute, 21 
Isometry, 169 
infinitesimal, 233 
local, 247 
Isoperimetric inequality, 23 
Isothermal coordinates, 109 


J 

Jacobi 
equation, 116, 118 
identity, 258 
theorem, 122 
vector field, 116 
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Jordan, C. (1838-1922), 57 


K 
Killing vector field, 232 


Killing, W. (1847-1923), 220, 235 
Koszul, J.-L. (1921-2018), 156, 170 


Koszul’s formula, 170 


L 
Lancret, M.A. (1774-1807), 48 
Laplace—Beltrami operator, 130 
Laplace operator, 110, 129, 253 
Laplacian, 110, 253 
Left-invariant 

form, 224 

metric, 222 

vector field, 212 
Left translation, 211 
Leibniz’ rule, 153 
Lemma 

Bochner’s, 133 

Gauss’, 102, 173 

Hilbert’s, 128 


Ricci’s, 181 
Length 
affine, 28 


of a curve, 1, 66, 173 
of geodesic circle, 103 
projective, 33 


Levi-Civita connection, 148, 155, 170 


Levi-Civita’s theorem, 169 


Levi-Civita, T. (1873-1941), 91, 94, 170 


Lie 
algebra, 212 
bracket, 212 
derivative, 229 
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group, 211 
Liebmann, H. (1874-1939), 128 
Liebmann’s theorem, 128 
Lie, S. (1842-1899), 217, 232, 235 
Line 

asymptotic, 105 

of curvature, 105 
Local isometry, 247 
Loop, 199 


M 
Mainardi, G. (1800-1879), 98 
Mangoldt, H.C.F. (1854-1925), 249 
Manifold 
flat, 197 
geodesically complete, 175 
Riemannian, 169 
Map 
conformal, 109, 110 
exponential, 100, 158, 214 
spherical Gauss, 87, 110 
Matrix 
connection, 164 
curvature, 165 
Hessian, 72 
Maurer, L. (1859-1927), 225 
Maurer—Cartan 
equation, 225 
form, 224 
Mean curvature, 73, 75 
of a hypersurface, 146 
Measure invariant, 34 
Meridian, 71 
Metric 
bi-invariant, 222 
bounded, 179 
geodesically complete, 175 
left-invariant, 222 
Riemannian, 67, 169 
space complete, 177 
tensor, 67 
Metrics conformally equivalent, 178 
Meusnier, J.-B. (1754-1893), 70, 85, 109, 110 
Meusnier’s 
formula, 70, 73 
theorem, 70 
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Minding, F.G. (1806-1885), 69, 125 
Minding theorem, 124 
Minimal 
hypersurface, 150 
surface, 108, 133 
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Mukhopadhyaya, S. (1866-1937), 13 
Multiplication in Lie algebra, 212, 215 
Multiplicity of conjugate points, 116 
Myers, S.B. (1910-1955), 251 
Myers-Steenrod theorem, 233 

Myers’ theorem, 250 


N 
Natural 
equation of a plane curve, 14 
parameter, 47 
Negative tube, 15 
Nephroid, 20 
Nondegenerate curve, 26, 29 
Normal 
affine, 29 
coordinates, 101, 193 
curvature 
of a curve, 69 
of a surface, 69, 73 
principal to a curve, 47, 55, 122 
spherical image, 122 
variation, 108, 151 


O 
One-parameter subgroup, 214 
Operator 
Laplace, 110, 129, 253, 255 
Laplace—Beltrami, 130 
star, 253 
Weingarten, 85, 143 
on a hypersurface, 145 
Oriented 
area, 2 
curvature of a curve, 4 
surface, 68 
Osculating 
circle, 6, 70 
curve, 6 
plane, 52 
sphere, 52 


P 
Parabola of safety, 18 
Parallel, 71 

transport, 91 

vector field, 89, 149 
Parameter 

arc length, 3 

natural, 47 
Parameterized curve, | 
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Peterson, K.M. (1828-1881), 98 
Peterson—Codazzi equations, 98 
Plane osculating, 52 
Point 

critical, 74 

focal, 7 

of inflection, 6, 26, 32 

sextatic, 33 

umbilical, 85, 105, 127, 146 
Positive tube, 15 
Principal 

curvature, 73 

of a hypersurface, 146 

normal indicatrix, 122 

normal to a curve, 47, 55, 122 
Product pseudoscalar, 27 
Projective 

curvature, 33 

length, 33 
Pseudoscalar product, 27 
Puiseux, V.A. (1820-1883), 104 


Q 


Quadratic form 
first, 66 
on a hypersurface, 146 
second, 72, 88, 187 
on a hypersurface, 146 
third, 146 


R 
Radius of curvature, 4 
Regular curve, 47, 51,56 
Regularly homotopic curves, 14 
Restricted holonomy group, 199 
Ricci-Curbastro, G. (1853-1925), 94, 187 
Ricci’s lemma, 181 
Ricci tensor, 186 
Riemann 

curvature tensor, 98 

tensor, 171 
Riemannian 

manifold, 169 

metric, 67, 169 

bounded, 179 

submanifold, 187 
Right-invariant vector field, 212 
Rigidity, 127 
Rinow, W. (1907-1979), 178 
Rodrigues, B.O. (1795-1851), 74, 85, 88 
Rodrigues’ formula, 85, 87, 105, 147 
Rule, Leibniz’, 153 
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Scalar curvature, 187 
Schur, F. (1856-1932), 186 
Schur’s theorem, 185 
Schwarzian derivative, 31 
Second 


Bianchi identity, 166, 182 

fundamental form, 187, 191 

quadratic form, 72, 88, 187 
on a hypersurface, 146 


Sectional curvature, 184, 191 
Sequence, Cauchy, 177 

Serret, J.A. (1819-1885), 4, 48 
Sextatic point, 33 

Smooth 


curve, | 
surface, 65 


Space 


of constant curvature, 185, 195, 197 
homogeneous, 235 

horizontal, 157 

vertical, 157 


Sphere 


geodesic, 175 
osculating, 52 


Spherical Gauss map, 87, 110, 147 


of a hypersurface, 146 


Squarable figure, 2 

Steiner, J. (1796-1863), 16, 153 
Steiner’s formula, 16, 152 
Strictly convex curve, 9 

Sturm, J.C.F. (1803-1855), 119 
Sturm’s theorem, 119, 245 
Submanifold 


Riemannian, 187 
totally geodesic, 191 


Surface 


closed, 65 

convex, 129 

embedded, 65 

immersed, 65 

minimal, 108, 133 

oriented, 68 

of revolution, 68, 71, 74, 86, 89, 105, 110, 
125 

smooth, 65 


Symbols, Christoffel, 85, 101, 155, 164 
Symmetric 


connection, 148, 161 
space, 239 


Symmetry 


of Christoffel symbols, 85, 155 
of connection, 148 
of covariant differentiation, 95 
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T curvature, 9, 53 
Tait, P.G. (1831-1901), 22 oriented curvature, 7 
Tensor, 159 torsion, 53 
curvature, 160 Totally geodesic submanifold, 191 
metric, 67 Trajectory, 200 
Ricci, 186 Translation, 211 
Riemann, 171 Transport 
Riemann curvature, 98 along a curve, 156 
torsion, 155, 160 along a piecewise smooth curve, 199 
Theorem Tube, 15 
Bertrand—Diguet—Puiseux, 103 negative, 15 
Bochner’s, 133 positive, 15 
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Cartan—Hadamard, 248 


Clairault’s, 89 U 

comparison Umbilical point, 85, 105, 127, 146 
for conjugate points, 246 Unbendability, 127 
for Jacobi fields, 245 Unimodular affine transformation, 26 


for metric, 247 
Fary—Milnor, 54 


Fenchel’s, 53, 54 Vv 

on four vertices, 10 Variation, 111 
Gauss’, 76, 86, 98 normal, 108, 151 
Gauss—Bonnet, 82 vector, 111 
Hadamard’s, 129, 149 Vector field 
Hopf-Rinow, 175, 247 Jacobi, 116 
Jacobi, 122 Killing, 232 
Levi-Civita’s, 169 left-invariant, 212 
Liebmann’s, 128 parallel, 89, 149 
Meusnier’s, 70 right-invariant, 212 
Minding, 124 Vertex of acurve, 10 
Myers’, 250 Vertical space, 157 


Myers-Steenrod, 233 
on rigidity of the sphere, 128 


Schur’s, 185 Ww 
Sturm’s, 119, 245 Weingarten 
Whitney—Graustein, 14 equations, 84 

Theorema Egregium, 77, 98 formula, 188, 189 

Third quadratic form, 146 operator, 85, 143 

Torsion on a hypersurface, 145 
of a curve in many-dimensional space, 57 Weingarten, J. (1836-1910), 85 
geodesic, 69 Weyl, H. (1885-1955), 153 
of a space curve, 48 Whitney—Graustein theorem, 14 
tensor, 155, 160 Whitney, H. (1907-1989), 15 
total, 53 Wirtinger’s inequality, 25 
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